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Preface 


if 


Some of the central questions of mathematical logic are: What is a 
mathematical proof? How can proofs be justified? Are there limitations 
to provability? To what extent can machines carry out mathematical 
proofs? 

Only in this century has there been success in obtaining substantial 
and satisfactory answers, the most pleasing of which is given by Gédel’s 
completeness theorem: It is possible to exhibit (in the framework of 
first-order languages) a simple list of inference rules which suffices to 
carry out all mathematical proofs. ''Negative'' results, however, appear 
in Gédel’s incompleteness theorems. They show, for example, that it is 
impossible to prove all true statements of arithmetic, and thus they reveal 
principal limitations of the axiomatic method. 

This book begins with an introduction to first-order logic and a proof of 
Gédel’s completeness theorem. There follows a short digression into model 
theory which shows that first-order languages have some deficiencies in 
expressive power. For example, they do not allow the formulation of 
an adequate axiom system for arithmetic or analysis. On the other hand, 
this difficulty can be overcome—even in the framework of first-order 
logic—by developing mathematics in set-theoretic terms. We explain the 
prerequisites from set theory that are necessary for this purpose and then 
treat the subtle relation between logic and set theory in a thorough manner. 

Godel’s incompleteness theorems are presented in connection with 
several related results (such as Trahtenbrot's theorem) which all exemplify 
the limitations of machine oriented proof methods. The notions of com- 
putability theory that are relevant to this discussion are given in detail. The 
concept of computability is made precise by means of a simple programming 
language. 


The development of mathematics in the framework of first-order logic (as 
indicated above) makes use of set-theoretic notions to an extent far beyond 
that of mathematical practice. As an alternative one can consider logical 
systems with more expressive power. We introduce some of these systems, 
such as second-order and infinitary logics. In each of these cases we point 
out deficiencies contrasting first-order logic. Finally, this empirical fact is 
confirmed by Lindstrom's theorems, which show that there is no logical 
system that extends first-order logic and at the same time shares all its 
advantages. 

The book does not require special mathematical knowledge; however, it 
presupposes an acquaintance with mathematical reasoning as acquired, for 
example, in the first year of a mathematics or computer science curriculum. 
Exercises enable the reader to test and deepen his understanding of the text. 
The references in the bibliography point out essays of historical importance, 
further investigations, and related fields. 

The original edition of the book appeared in 1978 under the title 
**Einfiihrung in die mathematische Logik." Some sections have been revised 
for the present translation; furthermore, some exercises have been added. 
We thank Dr. J. Ward for his assistance in preparing the final English 
text. Further thanks go to Springer-Verlag for their friendly cooperation. 


Freiburg and Aachen H.-D. EBBINGHAUS 
November 1983 J. FLUM 
W. THOMAS 
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PART A 


CHAPTER I 
Introduction 


Towards the end of the nineteenth century mathematical logic evolved into 
a subject of its own. It was the works of Boole, Frege, Russell, and Hilbert, 
among others,’ that contributed to its rapid development. Various elements 
of the subject can already be found in traditional logic, for example, in the 
works of Aristotle or Leibniz. However, while traditional logic can be 
considered as part of philosophy, mathematical logic is more closely related 
to mathematics. Some aspects of this relation are: 


(1) Motivation and Goals. Investigations in mathematical logic arose mainly 
from questions concerning the foundations of mathematics. For example, 
Frege intended to base mathematics on logical and set-theoretical principles. 
Russell tried to eliminate contradictions that arose in Frege's system. 
Hilbert's goal was to show that "the generally accepted methods of mathe- 
matics taken as a whole do not lead to a contradiction" (this is known as 
Hilbert's program). 


(2) Methods. In mathematical logic the methods used are primarily 
mathematical. This is exemplified by the way in which new concepts are 
formed, definitions are given, and arguments are conducted. 


(3) Applications in Mathematics. The methods and results obtained in 
mathematical logic are not only useful for treating foundational problems; 
they also increase the stock of tools available in mathematics itself. There are 
applications in many areas of mathematics, such as algebra and topology. 


‘ Aristotle (384-322 sc.), G. W. Leibniz (1646-1716), G. Boole (1815-1864), G. Frege (1848- 
1925), D. Hilbert (1862-1943), B. Russell (1872-1970). 
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However, these mathematical features do not result in mathematical 
logic being of interest solely to mathematicians. For example, the mathe- 
matical approach leads to a clarification of concepts and problems that also 
are of importance in traditional logic and in other fields, such as epistemology 
or the philosophy of science. In this sense the restriction to mathematical 
methods turns out to be very fruitful. 

In mathematical logic, as in traditional logic, deductions and proofs are 
central objects of investigation. However, it is the methods of deduction 
and the types of argument as used in mathematical proofs which are con- 
sidered in mathematical logic (cf. (1)). In the investigations themselves, 
mathematical methods are applied (cf. (2)). This close relationship between 
the subject and the method of investigation, particularly in the discussion 
of foundational problems, may create the impression that we are in danger 
of becoming trapped in a vicious circle. We shall not be able to discuss this 
problem in detail until Chapter VII, and we ask the reader who is concerned 
about it to bear with us until then. 


| An Example from Group Theory 


In this and the next section we present two simple mathematical proofs. 
They serve as illustrations of some of the methods of proof as used by 
mathematicians. Guided by these examples we raise some questions which 
lead us to the main topics of the book. 

We begin with the proof of a theorem from group theory. We therefore 
require the axioms of group theory, which we now state. We use ° to denote 
the group multiplication and e to denote the identity element. The axioms 
may then be formulated as follows: 


(G1) For all x, y,z: (xoy)oz =x0(¥°2). 
(G2) Forallx: xc°e =x. 
(G3) For every x there isa y such that xo y =e. 


A group is a triple (G, °°, e®) which satisfies (G1), (G2), and (G3). Here G 
is a set, e° is an element of G, and -“ isa binary function on G, Le., a function 
defined on all pairs of elements from G, the values of which are also elements 
of G. The variables x, y, z range over elements of G, © refers to o%, and e 
refers to e°. 

As an example of a group we mention the additive group of reals (R, +, 0), 
where R is the set of real numbers, + is the usual addition, and 0 is the real 
number zero. On the other hand, (R, -, 1) is not a group (where . is the usual 
multiplication). For example, the real number 0 violates axiom (G3): there 
is no real number r such that 0-r = 1. 

We call triples such as (R, +, 0) or (R, -, 1) structures. In Chapter III we 
shall give an exact definition of the notion of structure. 


§2. An Example from the Theory of Equivalence Relations 


Now we prove the following simple theorem from group theory: 


1.1 Theorem (Existence of a Left Inverse). For every x there is a y such that 
yox =e, 


ProoF. Let x be chosen arbitrarily. From (G3) we know that, for a suitable y, 


(1) xoy=e. 

Again from (G3) we get, for this y, an element z such that 

(2) yess 

We can now argue as follows: ‘ 
yox =(yox)ece (by (G2)) 


=(yex)e(yoz) (from (2)) 
= yo(xe(yoz)) (by (GI)) 
= yo((xey)ez) (by(Gl)) 


= yo(eoz) (from (1)) 
=(ycoe)oz (by (G1)) 
= yoz (by (G2)) 
= <¢ (from (2)). 


Since x was arbitrary, we conclude that for every x there is a y such that 
yox =e, C 


The proof shows that in every structure where (G1), (G2), and (G3) are 
satisfied, 1.e., in every group, the theorem on the existence of a left inverse 
holds. A mathematician would also describe this situation by saying that the 
theorem on the existence of a left inverse follows from, or is a consequence of 
the axioms of group theory. 


§2. An Example from the Theory of 
Equivalence Relations 


The theory of equivalence relations is based on the following three axioms 
(xRy is to be read "x is equivalent to y"): 


(El) For all x: xRx. 
(E2) For all x and y: If xRy, then yRx. 
(E3) For all x, y, z: If xRy and yRz, then xRz, 


Let A be a nonempty set, and let R* be a binary relation on A, i.e, 
R4 cA x A. For (a,b) € R* we also write aR“b. The pair (A, R“) is another 
example of a structure. We call R* an equivalence relation on A, and the 
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structure (A, R*)an equivalence structure if (E1),(E2), and (E3) are satisfied. 
For example, (Z, R,) is an equivalence structure, where Z is the set of integers 
and 


R, = {(a,b)|a,b € Z and b — ais divisible by 5). 


On the other hand, the binary relation R,, on Z, which holds between two 
integers if they are relatively prime, is not an equivalence relation over Z. 
For example, 5 and 7 are relatively prime, and 7 and /5 are relatively prime, 
but 5 and /5 are not relatively prime; thus (£3)does not hold for R,,. 

We now prove a simple theorem about equivalence relations. 


21 Theorem. If x and y are both equivalent to a third element, they are 
equivalent to the same elements. More formally, for all x and y, if there is au 
such that xRu and yRu, then for all z, xRz if and only if yRz. 


Proor. Let x and y be given arbitrarily; suppose that for some u 
(1) xRu and yRu. 
From (E2)we then obtain 

(2) uRx and uRy. 
From xRu and uRy we deduce, using (E3), 

(3) XRy, 

and from yRu and uRx we likewise get (using (E3)) 
(4) yRx. 

Now let z be chosen arbitrarily. If 

(S) xRz 

then, using (E3), we obtain from (4) and (5) 


yRzZ. 
On the other hand, if 
(6) yRz 
then, using (E3), we get from (3)and (6) 
xRz. 
Thus the claim is proved for all z. C 


As in the previous example, this proof shows that every structure (of the 
form (A, R“)) which satisfies the axioms (E/), (E2), and (E3), also satisfies 
Theorem 2./, ie., that 2./ follows from (E1),(E2), and (E3). 


§3. A Preliminary Analysis 


§3. A Preliminary Analysis 


We sketch some aspects which the two examples just given have in common. 
In each case one starts from a system ® of propositions which is taken to be a 
system of axioms for the theory in question (group theory, theory of equiv- 
alence relations). The mathematician is interested in finding the propositions 
which follow from ®, where a proposition yw is said to follow from © if w 
holds in every structure which satisfies all propositions in ®. A proof of & 
from a system ® of axioms shows that ¥ follows from ®. 

When we think about the scope of methods of mathematical proof, we 
are led to ask about the converse: J 


(*)  Isevery proposition y which follows from ® also provable from 
? 


For example, is every proposition which holds in all groups also provable 
from the group axioms (G1), (G2), and (G3)? 

The material developed in Chapters Hf through V and in Chapter VII 
yields an essentially positive answer to (*). Clearly it is necessary to make the 
concepts "proposition", “follows from", and "provable", which occur in 
(*), more precise. We sketch briefly how we shall do this. 


(1) The Concept "Proposition". Normally the mathematician uses his 
everyday language (e.g., English or German) to formulate his propositions. 
But since sentences in everyday language are not, in general, completely 
unambiguous in their meaning and structure, we cannot specify them by 
precise definitions. For this reason we shall introduce a formal language L 
which reflects features of mathematical statements. Like programming 
languages used today, L will be formed according to fixed rules: Starting 
with a set of symbols (an "alphabet"'), we obtain so-called formulas as finite 
symbol strings built up in a standard way. These formulas correspond to 
propositions expressed in everyday language. For example, the symbols of 
L will include ¥ (to be read "for all"), A ("and"), — ("if... then"), = 
("equal"), and variables like x, y, and z. Formulas of L will be expressions 
like 


Vex =x X = y, X =Z, 


and 


Il! 


Vx Vy Vz(x Sy Ay =Z)> x = Z). 

Although the expressive power of L may at first appear to be limited, we 
shall later see that many mathematical propositions can be formulated in L. 
We shall even see that L is in principle sufficient for all of mathematics. The 
definition of L will be given in Chapter I]. 
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(2) The Concept "Follows From" (the Consequence Relation). Axioms 
(G1)-(G3) of group theory obtain a meaning when interpreted in structures 
of the form (G, °%, e®). In an analogous way we can define the general notion 
of an L-formula holding in a structure. This enables us (in Chapter III) to 
define the consequence relation: $follows jrom (is a consequence of) ® if 
and only if w holds in every structure where all formulas of ® hold. 


(3) The Concept "Proof ”. A mathematical proof of a proposition yw from 
a system ® of axioms consists of a series of inferences which proceeds from 
axioms of ® or propositions that have already been proved to new proposi- 
tions, and which finally ends with w. At each step of a proof the mathematician 
writes something like "From... and one obtains directly that - 
and he expects it to be clear to anyone that the validity of... and of 

entails the validity of : 

An analysis of examples shows that the grounds for accepting such 
inferences are often closely related to the meaning of connectives, such as 
"and", "or", or "if-then", and quantifiers, for all" or "there exists", which 
occur there. For example, this is the case in the first step of the proof of 1.1, 
where we deduce from "for all x there isa y such that xo y = e” that for the 
given x there isa y such that xo y = e. Or consider the step from (1) and (2) 
to (3) in the proof of 2.1, where from the proposition “xRu and yRu” we 
infer the left member of the conjunction, “xRu”, and from “uRx and uRy” 
we infer the right member, “uRy”, and then using (E3) we conclude (3). 

The formal character of the language L makes it possible to represent these 
inferences as formal operations on symbol strings (the L-formulas). Thus, 
the inference of “xRu” from “xRu and yRu” mentioned above ccrresponds 
to the passage from the L-formula (xRu a yRu) to xRu. We can view this 
as an application of the following rule: 


(+)  Itis permissible to pass from an L-formula of the form (g A w) 
to the L-formula @. 


In Chapter IV we shall give a finite system S of rules which, like (+), corre- 
spond to elementary inference steps the mathematician uses in his proofs. A 
formal proof of the L-formula y% from the L-formulas in ® (the "axioms" 
consists then (by definition) of a sequence of formulas in L which ends with 
yw, and in which each L-formula is obtained by application of a rule from S 
to the axioms or to preceding formulas in the sequence. 

Having introduced the precise notions, one can convince oneself by 
examples that mathematical proofs can be imitated by formal proofs in L. 
Moreover, in Chapter V we shall return to the question (*) and answer it 
positively, showing that if a formula yw follows from a set ® of formulas, then 
there is a proof of ¥ from ®, even a formal proof. This is the content of the 
so-called Gédel completeness theorem. 


Preview 
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Gédel’s completeness theorem forms a bridge between the notion of proof, 
which is formal in character, and the notion of consequence, which refers 
to the meaning in structures. In Chapter VI we shall show how this connec- 
tion can be used in algebraic investigations. 

Once a formal language and an exact notion of proof have been introduced, 
we have a precise framework for mathematical investigations concerning, 
for instance, the consistency of mathematics or a justification of rules of 
inference used in mathematics (Chapters VII and X). 

Finally, the formalization of the notion of proof creates the possibility of 
using a computer to carry out or check proofs. In Chapter X we shall discuss 
the range and the limitations of such machine-oriented methods. 

In the formulas of L the variables refer to the elements of a structure, for 
example, to the elements of a group or the elements of an equivalence 
structure. In a given structure we often call elements of the domain A first- 
order objects, while subsets of A are called second-order objects. Since L 
only has variables for first-order objects (and thus expressions such as 
“Wx” and “dx” apply only to the elements of a structure), we call L a first- 
order language. 

Unlike L, the so-called second-order language also has variables which 
range over subsets of the domain of a structure. Thus a proposition about a 
given group which begins "For all subgroups...” can be directly formulated 
in the second-order language. We shall investigate this language and others 
in Chapter IX. In Chapter XII we shall be able to show that no language with 
more expressive power than L enjoys both an adequate formal concept of 
proof and other useful properties of L. From this point of view L is a “best- 
possible" language, and so we succeed in justifying the dominant réle which 
the first-order language plays in mathematical logic. 


CHAPTER II 
Syntax of First-Order Languages 


In this chapter we introduce the first-order languages. They obey simple, 
clear formation rules. In later chapters we shall discuss whether and to what 
extent all mathematical propositions can be formalized in such languages. 


1 Alphabets 


By an alphabet A we mean a nonempty set of symbols. Examples of alphabets 
are the sets A, = {0, 1, 2,...,9}, A, = {a, b,c,..., x, y, z} (the alphabet of 
lower-case letters), A, = {¢, {, a, d, x, f,),@Q,and A, = {cg, Cy, C2,...}. 

We call finite sequences of symbols from an alphabet A strings or words 
over A. A* denotes the set of all strings over A. The length of a string ¢ € A* 
is the number of symbols, counting repetitions, occurring in ¢. The empty 
string is also considered to be a word over A. It is denoted by UL), and its 
length is zero. 

Examples of strings over A, are 


softly, xdbxaz. 


Examples of strings over A, are 


[109 dx xs (fas. 


Suppose A = {|, ||}, that is, A consists of the symbols! a, =| and a, := |]. 
Then the string {|| over A can be read three ways: as @,4,@,, aS a,a,, and as 


! Here we write "a, :=|” instead of ''a, = |” in order to make it clear that a, is defined by the 
right-hand side of the equation. 
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a,a,. In the sequel we shall allow only those alphabets A where any string 
over A can be read in exactly one way. The alphabets A,,..., Ag given above 
satisfy this condition. 

We now turn to questions concerning the number of strings over a given 
alphabet. 

We call a set M countable if it is not finite and if there is a surjective map 
a of the set of natural numbers N = {0, 1, 2,...} onto M. We can then 
represent M as {a(m)|7eN} or, if we write the arguments as indices, as 
{a, |e N). A set M is called at most countable if it is finite or countable. 


1.1 Lemma. For a nonempty set M the following are equivalent: 


(a) M is at most countable. 
(b) There is a surjective map a: N — M. 
(c) There is an injective map §: M - N. 


Proor.” We shall prove (b) from (a), (c) from (b), and (a) from (c). 
(b) from (a): Let M be at most countable. If M is countable (b) holds by 
definition. For finite M, say M = {a,, ...,a,) (M is nonempty), we define 


a: NM by 
a; fOTiln, 
a(i) = 
ag otherwise. 


a is Clearly surjective. 
(c) from (b): Let a: N-M _ be surjective. We define an injective map 
B:M — N by setting, for a € M, 


B(a) := the least i such that «(i) = a. 


(a) from (c): Let 8: M > N be injective and suppose M is not finite. We 
must show that M is countable. To do this we define a surjective map 
a: N — M inductively as follows: 


a(0) = the a € M with the smallest image under f in N, 
a(n + 1):=the a € M with the smallest image under f greater 


than B(a(0)), ..., B(a(n)). 


Since the images under f are not bounded in N, a is defined for all ne N, 
and clearly every a € M belongs to the image of a. L 


Every subset of an at most countable set is at most countable. If M, and 
M, are at most countable, then so is M, u M,. The set R of real numbers 
is neither finite nor countable: it is uncountable. 


2 The goal of our investigations is, among other things, a discussion of the notion of proof. 
Therefore the reader may be surprised that we use proofs before we have made precise what a 
mathematical proof is. As already mentioned in Chapter I, we shall return to this apparent 
circularity in Chapter VII. 


12 II. Syntax of First-Order Languages 


We shall later show that finite alphabets suffice for representing mathe- 
matical statements. Moreover, the symbols may be chosen as "concrete" 
objects such that they can be included on the keyboard of a typewriter. 
Often, however, one can improve the transparency of an argument by using 
a countable alphabet such as A,, and we shall do this frequently. For some 
mathematical applications of methods of mathematical logic it is also 
useful to consider uncountable alphabets. The set {c,|r € R}, which contains 
a symbol c, for every real number r, is an example of an uncountable alphabet. 
We shall justify the use of such alphabets in VII.4. 


1.2 Lemma. /f A is an at most countable alphabet, then the set A* of strings 
over A is countable. 


PRoor. Let p, be the nth prime number: pp = 2, p, = 3, p2 = 5, and so on. 
If A is finite, say A = {a,, ...,a,), where a,, ...,a, are pairwise distinct, 
or if A is countable, say A = {a,, a,, a,, ...), where the a; are pairwise 
distinct, we can define the map B: A* — N by 


AQ) = 1, Bla; .-- a) = Pei’ *....pr'. 


Clearly f is injective and thus (cf. 1.1(c)) A* is at most countable. Since 
a,, a,a,, a,a,a,, ...areallin A* itcannot be finite; hence it iscountable. LJ 


1.3 Exercise. Let a: N — R be given. For a, be R such that a < bshow that 
there is a point c in the interval I = [a, b] such that c ¢ {a(m)|ne N}. Con- 
clude from this that I, and hence RK also, are uncountable. (Hint: By induction 
define a sequence I = I, > I, >... of closed intervals such that a(n) ¢ 1,4, 
and use the fact that (\,en L # @.) 


1.4 Exercise. Show that if M, and M, are countable sets and M, (CM, ¢ 
M,, then M, is also countable. 


1.5 Exercise. (a) Show that if the sets My, M,,... are at most countable 
then the union | };.,, M; is alsa.at most countable. 
(b) Use (a) to give a different proof of Lemma 1.2. 


§2. The Alphabet of a First-Order Language 


We wish to construct formal languages in which we can formulate, for 
example, the axioms, theorems, and proofs about groups and equivalence 
relations which we considered in Chapter I. In that context the connectives, 
the quantifiers, and the equality relation played an important réle. Therefore, 


§2. The Alphabet of a First-Order Language 13 


we shall include the following symbols in the first-order languages: — (for 
"not"), A (for and"), v (for "or"), > (for "if-then"'), <> (for "if and only 
if"), ¥ (for "for all"), 3 (for "there exists"), = (as symbol for equality). To 
these we shall add variables (for elements of groups, elements of equivalence 
structures, etc.) and finally parentheses as auxiliary symbols. In order to 
formulate the axioms for groups we also need certain symbols peculiar to 
group theory, e.g., a binaryfunction symbol, say °, to denote the group multi- 
plication, and a symbol, say e, to denote the identity element. We call e a 
constant symbol, or simply, a constant. For the axioms of the theory of 
equivalence relations we need a binary relation symbol, say R. 

Thus, in addition to the "logical" symbols such as “7” and “ A ”, we 
shall need a set S of relation symbols, function symbols, and constants which 
varies from theory to theory. Each such set S of symbols determines a 
first-order language. 

We summarize: 


2.1 Definition. The alphabet df a jirst-order language contains the following 


symbols: 

(a) U9, U4, 02,... (variables); 

(b) 4, 4 v,, © (not, and, or, if-then, if and only if); 
(c) V,3 (for all, there exists); 

(d) = (equality symbol); 

(e) ),¢ (parentheses); 


(f) (1) for every n > 1 a (possibly empty) set of n-ary relation symbols; 
(2) for every n > 1 a (possibly empty) set of n-aryfunction symbols; 
(3) a (possibly empty) set of constants. 


We shall denote by A the set of symbols listed in (a) through (e), and by 
S the set of symbols from (f). S may be empty. The symbols listed under (f) 
must, of course, be distinct from each other and from the symbols in A. 

S determines a first-order language (cf. §3). We call As:= A uS the 
alphabet of this language and S its symbol set. 

We have already become acquainted with some symbol sets: S,, := {9, e} 
for group theory and S,,:= {R )for the theory of equivalence relations. For 
the theory of ordered groups we could use {o, e, R), where the binary relation 
symbol R is now taken to represent the ordering relation. In certain theo- 
retical investigations we shall use the symbol set S,,, which contains the 
constants Cg, Cy, Cz,..., and for every n= 1 the countably many n-ary 
relation symbols Ro, Rj, R3,... and n-ary function symbolsf 5,f4, /3,.--. 

Henceforth we shall use the letters P, Q, R,... to stand for relation 
symbols, f, g, h,... for function symbols, c, ¢g, c;,... for constants, and 
X, y, Z,... for variables. 
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§3. Terms and Formulas in First-Order Languages 


Given a symbol set S, we call certain strings over A, formulas of the first-order 
language determined by S. For example, if S = S,, we want the strings 


€ = 8, €ov7; = tp, dvy(e =e an vy = v2) 


to be formulas, but not 


The formulas e =e and ecv, = vz have the form of equations. Mathe- 
maticians call the strings to the left and to the right of the equality symbol 
terms. Terms are "meaningful" combinations of function symbols, variables, 
and constants (together with commas and parentheses). Clearly, to give a 
precise definition of formulas and thus, in particular, of equations, we must 
first specify more exactly what we mean by terms. In mathematics terms 
are written in different notations, such as x + e, g(x, e), gxe. We choose a 
parenthesis-free notation system, as in gxe. 

To define the notion of term we give instructions (or rules) which tell us 
how to generate the terms. (Such a system of rules is often called a calculus.) 
This is more precise than a vague description, and simpler than an explicit 
definition. 


3.1 Definition. S-terms are precisely those strings in Af whichcan be obtained 
by finitely many applications of the following rules: 


(T1) Every variable is an S-term. 

(T2) Every constant in S is an S-term. 

(T3) If the strings tg,...,¢,- 4 are S-terms andf is an n-ary function symbol 
in S, then ft, ...t,—, is also an S-term. 


We denote the set of S-terms by T°. 


Iff is a unary and g a binary function symbol and S = {f, g,c, R}, then 
Qo fgvac 


is an S-term. First of all, c is an S-term by (T2) and vp and vy are S-terms by 
(T1). If we apply (T3) to the S-terms v4 and c and to the function symbol g, 
we see that gv4c isan S-term. Another application of (T3) to the S-term gv,c 
and to the function symbol f shows that fgv,c is an S-term, and a final 
application of (T3) to the S-terms vg and fgv,c and to the function symbol 
g shows that guy fgvac is an S-term. 

We say that one can derive the string gvp fgv4c¢ in the calculus of terms 
(corresponding toS). The derivation just described can be given schematically 
as follows: 
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Le (T2) 

2. Ug (T1) 

3. Dy (T1) 

4. gugc (T3) applied to 3 and 1 using g 
5. fgvgc (T3) applied to 4 usingf 

6. gvo fgv4c (T3) applied to 2 and 5 using g. 


The string directly following the number at the beginning of each line can 
be obtained in each case by applying a rule of the calculus of terms; applica- 
tions of (T3) use terms obtained in preceding lines. The information at the 
end of each line indicates which rules and preceding terms were used. Clearly, 
not only the string in the last line, but all strings in preceding lines can be 
derived and hence are S-terms. 

The reader should show that the strings gxgxfy and gxgfxfy are S-terms 
for arbitrary variables x and y. Here we give a derivation to show that the 
string oxcey is an S,,-term. 


Ls & (T1) 
229 (T1) 
3, e (T2) 


4, cey (T3) applied to 3 and 2 using 
5. oxcey (T3) applied to | and 4 using ». 


Mathematicians usually write the term in line 4 as ec y, and the term in 
line 5 as x o (ec y). For easier reading we shall sometimes write terms in this 
way as well. 

Using the notion of term we are now able to give the definition of formulas. 


3.2 Definition. S-formulas are precisely those strings of A¥ which are obtained 
by finitely many applications of the following rules: 


(F1) If t) and t, are S-terms, then tg = t, is an S- formula. 

(F2) If to,...,¢,-, are S-terms and R is an n-ary relation symbol from S, 
then Rt, ...¢,_, is an S-formula. 

(F3) If m is an S-formula, then “@ is also an S-formula. 

(F4) If pand wy are S-formulas, then (g a $),(g v $), (gy > W), and (eo W) 
are also S-formulas. 

(F5) If @ is an S-formula and x is a variable, then Vx@ and Jxq@ are also 
S-formulas. 


S-formulas derived using (F1) and (F2) are called atomic formulas because 
they are not formed by combining other S-formulas. —@ is called the 
negation of g, and (g a W), (@ v §$), and (@ > W) are called, respectively, 
the conjunction, disjunction, and implication of g and w. 

We use L* to ‘denote the set of S-formulas. L* is the jirst-order language 
corresponding to the symbol set S (often called the language d first-order 
predicate calculus corresponding to S). 
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Instead of S-terms and S-formulas, we often speak simply of terms and 
formulas when the reference to S is either clear or unimportant. For terms 
we use the letters ¢, fg, f;,..., and for formulas the letters ¢, y,... 

We now give some examples Let S=S, = {R}. We can express the 
axioms for the theory of equivalence relations by the following formulas: 


Vtg Rvp U9, 
Vv9 VU, (Rv9 0, > Rv,U9), 
Vvo Vo, Vo2((Rugt, A Rv,v2) > Rvot>). 


One can verify that these strings really are formulas by giving appropriate 
derivations (as was done above for terms) in the calculus of S,,-formulas. 
For the first two formulas we have, for example, 


(1) 1. Rug tv (F2) 
2. YUg RUp Uo (F5) applied to 1 using V, tg. 
(2) 1. Rvgv, (F2) 
2. Rv» v (F2) 
3. (Rtugv, > Rv,vo) (F4) applied to 1 and 2 using > 
4. Vu,(Rugv, > Rv,vo) (F5) applied to 3 using VY, v, 
5. VWvo Vu (Rvugv, > Rvyv 9) (F5) applied to 4 using Y, vo. 


In a similar way the reader should convince himself that, for unary f, binary g, 


unary P, ternary Q, and variables x, y, and z, the following strings are 
{ f, g, P, Q)-formulas : 


(1) Vy(Pz > Qxxz); 
(2) (Pgxfy ~ Ix(x = x A x Sx)); 
(3) Vz Vz dz Qxyz. 


In spite of its rigor the calculus of formulas has "liberal" aspects: we can 
quantify over a variable which does not actually occur in the formula in 
question (as in (1)), we can join two identical formulas by means of a.con- 
junction (as in (2)), or we can quantify several times over the same variable 
(as in (3)). 

For the sake of clarity we shall frequently use an abbreviated or modified 
notation for terms and formulas. For example, we shall write the S,,-formula 
Rvgv, aS vp Rv, (compare this with the notation 2 < 3). Moreover, we shall 
often omit parentheses if they are not essential in order to avoid ambiguity, 
e.g., the outermost parentheses surrounding conjunctions, disjunctions, etc. 
Thus we may write g a W for (g a w). In the case of iterated conjunctions 
or disjunctions we shall agree to associate to the left,e.g..@ a wa x will be 
understood to mean ((g A W) A x). The reader should always be aware that 
expressions in the abbreviated form are no longer formulas in the sense of 
3.2. We emphasize once again that we need an exact definition of formulas 
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in order to have a precise notion of mathematical statement in our analysis 
of the notion of proof. 

Perhaps the following analogy with programming languages will clarify 
the situation. When writing a program one must be meticulous in following 
the grammatical rules for the programming language, because a computer 
can process only a formally correct program. But programmers use an 
abbreviated notation when devising or discussing programs in order to 
express themselves more quickly and clearly. 

We have used = for the equality symbol in first-order languages in 
order to make statements of the form @ = x = y (“q@ is the formula x = y") 
easier to read. 

For future use we note the following: 


3.3 Lemma. If S is at most countable, then T* and L* are countable. 


ProoF. If S is at most countable, then so is A, and hence by 1.2 the set A$. 
Since T° and L* are subsets of A¥ they are also at most countable. On the 
other hand T° and L* are infinite because T* contains the variables vo, v,, 
v,,...,and L* contains the formulas v9 = v9, 0; = U1, U2 = U,... (even if 


S = 9). O 


With the preceding observations the languages L* have become the 
object of investigations. In these investigations we use another language, 
namely everyday English augmented by some mathematical terminology. 
In order to emphasize the difference in the present context, the formal 
language L°, being discussed, is called the object language; the language 
English, in which we discuss, is called the metalanguage. In another context, 
for example, in linguistic investigations, everyday English could be an object 
language. Similarly, first-order languages can play the réle of metalanguages 
in certain set-theoretical investigations (cf. VII.4.3). 


Historical Note. G. Frege [10] developed the first comprehensive formal 
language. He used a two-dimensional system of notation which was so 
complicated that his language never came into general use. The formal 
languages used today are based essentially on those introduced by G. Peano 


[22]. 


$4. Induction in the Calculus of Terms and in 
the Calculus of Formulas 


Let S be a set of symbols and let Z < A€ be a set of strings over Ag. In the 
case where.Z = T° or Z = L* we described the elements of Z by means of a 
calculus. Each rule of such a calculus either says that certain strings belong 
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to Z (e.g., the rules (T1), (T2), (F1), and (F2)), or else permits the passage 
from certain strings ¢9,..-,¢,-, to a new string € in the sense that, if 
Cor+++5Cn—-1 all belong to Z, then ¢ also belongs to Z. The way such rules 
work is made clear when we write them schematically, as follows: 


Cos-++> Ont 
C 
By allowing n = 0, the first sort of rule mentioned above ("premise-free" 


rules) is included in this scheme. 
Now we can write the rules for the calculus of terms as follows: 


(T1) —_, 
x 
(T2) ae ifceS; 


(73) eee eer 


flgusten iff ¢ S$ and f is n-ary. 


When we define a set Z of strings by means of a calculus C we can then prove 
assertions about the elements of Z by means of induction over C. This principle 
of proof corresponds to induction over the natural numbers. If one wants to 
show that all elements of Z have a certain property P, then it is sufficient to 
show that 


(for every rule 


(*) 

) ; 
of the calculus C, the following holds: whenever f5,.--5f,—y 
are derivable in C and have the property P ("induction hypo- 
thesis"), then { also has the property P. 


Hence in case n = 0 we must show that € has the property P. 

This principle of proof is evident: in order to show that all strings de- 
rivable in C have the property P, we show that everything derivable by 
means of a ''premise-free" (i.e., n = 0 in (*)) rule has the property P, and 
then that P is preserved under application of the remaining rules. The method 
can also be justified using the principle of complete induction for natural 
numbers. For this purpose, one defines in the obvious way the length of a 
derivation in C (cf. the examples of derivations in §3), and then argues as 
follows: If the condition (I) is satisfied for P, one shows by induction on n 
that every string which has a derivation of length n has the property P. 
Since every element of Z has a derivation of some finite length, P must then 
hold for all elements of Z. 

In the special case where C is the calculus of terms or the calculus of 
formulas, we call the proof procedure outlined above proof by induction on 
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terms or on formulas, respectively. In order to show that all S-terms have a 
certain property P it is sufficient to show: 


(T1) Every variable has the property P. 

(T2) Every constant in S has the property P. 

(T3)’ If the S-terms to,...,f,-1 have the property P, and if f €S is n-ary, 
then ft, ...t,—, also has the property P. 


In the case of the calculus of formulas the corresponding conditions are 


(F1)’ Every S-formula of the form tg = f£, has the property P. 

(F2)’ Every S-formula of the form Rt, ...t,_, has the property P. 

(F3)' If the S-formula @ has the property P, then “@ also has the property P. 

(F4)’ If the S-formulas g and w have the property P, then (g a $), ( v §), 
(~ > Ww), and (g + wW) also have the property P. 

(F5) If the S-formula @ has the property P and ff x is a variable, then Vx 
and 4x@ also have the property P. 


We now give some applications of this method of proof. 


4.1 Lemma. (a) For all symbol sets S the empty string LJ is neither an S-term 
nor an S-formula. 

(b) (1) © is not an S,,-term. 
(2) cov, is not an S,,-term. 

(c) For all symbol sets S, every S-formula contains the same number of right 
parentheses “)” as left parentheses “(”. 


Proor. (a) Let P be the property on A¥ which holds for a string ¢ iff ¢ is 
nonempty. We show by induction on terms that every S-term has the property 
P, and leave the proof for formulas to the reader. 


(T1)' and (T2)’: Terms of the form x or c are nonempty. 


(T3)’: Every term formed according to (T3) begins with a function symbol, 
and hence is nonempty. (Note that we do not need to use the induction 
hypothesis.) 

(b) We leave (1) to the reader. To prove (2), let P be the property on 
Ag,, which holds for a string ¢ over Ag, iff ¢ is distinct from ecv,. We shall 
show by induction on terms that every S,,-term is distinct from cev,. The 
reader will notice that we start using a more informal presentation of in- 
ductive proofs. 


t = x,t = e: t is distinct from the string oov,. 


= of,t,: If off, were cov, then, by (a), we would havet, = cand t, = v4. 
But tf; = e contradicts (1). 
(c) First one shows by induction on terms that no S-term contains a left 


or right parenthesis. Then one considers the property P over A*, which 
holds for a'string ¢ over A\s iff¢ has the same number of left parentheses as 
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right parentheses, and one shows by induction on formulas that every 
S-formula has the property P. We give some cases here as examples: 


( = to =t,, where ty and f, are S-terms: By the observation above there 
are no parentheses in ¢, thus P holds for g¢. 


~ = 1, where y has the property P by induction hypothesis: Since @ 
does not contain any parentheses except those in w, ¢~ also has the property P. 


w =( A x), where P holds for y and yx by induction hypothesis: Since 
~ contains one left parenthesis and one right parenthesis in addition to the 
parentheses in w and y, the property P also holds for g. 


oy = Vx, where y has the property P by induction hypothesis: The proof 
here is the same as in the case y = “1. L 


Next we want to show that terms and formulas have a unique decomposi- 
tion into their constituents. The following two lemmas contain some pre- 
liminary results needed for this purpose. We refer to a fixed symbol set S. 


4.2 Lemma. (a) For all terms t and t', t is not a proper initial segment oft' 
(i.e., there is no ¢ distinct from LJ such that t€ = t’). 


(b) For all formulas ~ and ¢', ~ is not a proper initial segment of ©’. 


We confine ourselves to the proof of (a), and consider the property P, 
which holds for a string 7 iff 


(*) for all terms t', 7 is not a proper initial segment oft' and f’ is not 

a proper initial segment of 7. 
Using induction on terms, we show that all terms ¢ have the property P. 
(a) will then be proved. 


t = x: Let t’ be an arbitrary term. By 4.1(a), t’ cannot be a proper initial 
segment of x, for then t’ would have to be the empty string L]. On the other 
hand, one can easily show by induction on terms that x is the only term which 
begins with the variable x. Therefore, cannot bea proper initial segment oft’. 


t 


c: The argument is similar. 


t = fto...t,-, and (*) holds for tp,..., t,-,: Let ¢ be an arbitrary-fixed 
term. We show that t’ cannot be a proper initial segment of t. Otherwise 
there would be a € such that 


(1) C¢#U0), t=. 


Since ¢’ begins with f (for ¢ begins with) ,t' cannot bea variable or a constant, 
thus t' must have been generated using (T3). Therefore it has the form 
fto-.-t,—, for suitable terms tg... t,-1- From (/) we have 


(2) fie ectgag = ftp aed 


Induction in the Calculus of Terms and in the Calculus of Formulas 21 


and from this, cancelling the symbol f, we obtain 
(3) ies ele a Heme Maney 


Therefore f is an initial segment of fo or vice versa. Since {9 satisfies (*) by 
induction hypothesis, neither of these can be a proper initial segment of the 
other. Thus tg = tg. Cancelling tf) on both sides of (3) we obtain 


(4) Se OO ee earn eer 
Repeatedly applying the argument leading from (3) to (4) we finally obtain 
O=¢. 


This contradicts (1). Therefore ¢’ cannot be a proper initial segment of t. 
The proof that f cannot be a proper initial segment of fis analogous. LU 


Applying 4.2 one can obtain in a similar way 


4.3 Lemma. (a) [fty....,f,-1 and to,..., ty— 1 are terms, and ifty...t,-1 = 
fo...0,-4,thenn = mand t; = t; fori <n. 
(b) If’ @o,-++5Pn—1 Nd Po,--+5Pm—1 are formulas, and if Og---QMn-1 = 


Po... Om-1, thenn = mand @; = @; for i <n. 
Using 4.2 and 4.3 one can then easily prove 


4.4 Theorem. (a) Every term is either a uariable or a constant or a term of the 
form fto...t,—,. In the last case the function symbol f and the terms 
to,-++5l,—4 are uniquely determined. 

(b) Every formula is of the form(1) tg = ty, or (2) Rtp...ty—4, or (3) “19, or 
(4) (9 ~ Ww), or (5) (9 v W), or (6) (gy > W), or (7) (p > H), or (8) Vx@, or 
(9) dx@, where the cases (1)-(9) are mutually exclusive and the following 
are uniquely determined: the terms tg and t, incase (1), the relation symbol 
Rand the terms to,-.+5t,—, in case (2), the formula @ in (3), the formulas 
~ and w in (4), (5), (6), and (7), and the variable x and the formula ¢ in (8) 
and (9). L 


Theorem 4.4 asserts that a term or a formula has a unique decomposition 
into its constituents. Thus, as we shall now show, we can give inductive 
definitions on terms or formulas. For example, to define a function for all 
terms it will be sufficient 


(T1)" to assign a value to each variable; 

(T2)” to assign a value to each constant; 

(T3)” for every n-ary f and for all terms to,..., f,—-, to assign a value to the 
term ftp... f,-1 assuming that values have already been assigned to 
the terms tg,...,¢ 


>°n-1: 


Each term is assigned exactly one value by (Tt)” through (T3)”, We show this 
by means of induction on terms as follows. 


22 I]. Syntax of First-Order Languages 


t = x: By 4.4(a) t is not a constant and does not begin with a function 
symbol. Therefore, it is assigned a value only by an application of (T1)”. 
Thus t is assigned exactly one value. 


t = c: The argument is analogous to the preceding case. 


t = fto...t,-,;, and each of the terms t,,...,t,-; has been assigned 
exactly one value: To assign a value to t we can only use (T3)’, by 4.4(a). 
Since, again by 4.4(a), the t; are uniquely determined, t is assigned a unique 
value. 


We now give some examples of inductive definitions. 
The function var (more precisely, vars), which associates with each 
S-term the set of variables occurring in it, can be defined as follows: 


4.5 Definition. 
var(x) = {x}; 
var(c) = @; 
var( ftp... t,—-1) = Var(fp) U... u var(t,— 4). 


The function SF, which assigns to each formula the set of its subformulas, 
can be defined by induction on formulas as follows: 


SF(fg = ty) = {t, = t}; 
SF(Rtg ... fy--1) = {Rtg eee tna 
SF(A@):= {70} u SF(g): 


SF((@ A #)) = {(o A W)} VU SF(@) VU SF);. 
similarly for (g v $), (g > §), and (gy @ $); 


SF(Vx@) = {Vx} u SF(@); 
similarly for 4x@. 


A means of defining the preceding notions by calculi is indicated in the 
following exercise. 
4.6 Exercise. (a) Let the calculus C, consist of the following rules: 


y ft; 
xX x? Ve tigecsi 


iff € Sis n-ary andi <n. 


Show that, for all variables x and all S-terms t, xt is derivable in C, iff x € var(t). 
(b) Give a result for SF analogous to the result for var in (a). 


4.7 Exercise. Alter the calculus of formulas by omitting the parentheses in 
3.2(F 4), e.g., by writing simply “g a y” instead of “(g@ a W)”. So, for example, 
YX := 3vp Pvg A Qv, is a {P,Q}-formula in this new sense. Show that the 
analogue of 4.4 no longer holds, and that the corresponding definition of 
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SF(y) can yield both SF(y) = (z, Pvp A Qv,, Pv), Qv,} and SF(y) = 
(¥, dvg Pv9, Pvp, Qv,}, so that SF is no longer a well-defined function. 


4.8 Exercise (Parenthesis-Free, or So-Called Polish Notation for Formulas). 
Let S be a symbol set and let A' be the set of symbols given in 2.1(a)—(d). Let 
Ay = A' US. Define S-formulas in Polish notation (S — P-formulas) to be 
all strings over Ag which can be obtained by finitely many applications of 
the rules (F1), (F2), (F3), and (F5) from 3.2, and the rule (F4)’: 


(F4) If @ an W are S — P-formulas then a gy, v pi), sow, and oy are 
also S — P-formulas. 


Prove the analogues of 4.3(b) and 4.4(b) for S — P-formulas. 


§5. Free Variables and Sentences 
Let x, y, and z be distinct variables. In the {R}-formula 


p :=dx(Ryz a Vy(Ty = & v Ryz)) 


the occurrences of the variables y and z marked with single underlining are 
not quantified, 1.e., not in the scope of a corresponding quantifier. Such 
occurrences are called free, and as we shall see later, the variables there act 
as parameters. The occurrences of the variables x and y marked with double 
underlining shall be called bound occurrences. (Thus y has both free and 
bound occurrences in @.) 

We give a definition by induction on formulas of the set of free variables 
in aformula @; we denote this set by free(@). Again we fix a symbol set S. 


5.1 Definition. 
free(ty = t,) := var(t)) u var(t,); 
free(Pty ...t,-1)#= var(to) U-++ U var(t,— 1); 
free(1¢) := free(¢); 
free((g@ * w)):= free(@) u free($) for * = A, V, -+,0; 
free(Vx@) := free(p) — {x}; 
free(4x@) := free{@) — {x). 


The reader should use this definition to determine the set of free variables 
in the formula @ at the beginning of this section (S = {R)). We do this here 
for a simpler example. Again let x, y, and z be distinct variables. 

free((Ryx — Vy Ty = z)) = free(Ryx) u free(Vy Ty = z) 
= {x,y} u (free(ny = z) — {y}) 
= ix, y} UCy, 2} — {y}) = {x y, 2}. 
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Formulas without free variables ("'parameter-free'' formulas) are called 
sentences. For example, Jtp vp) = Up is a sentence. 

Finally, we denote by L* the set of S-formulas in which the variables 
occurring free are among Up,..., Py—1: 


L* := {g|@ is an S-formula and free (p) < {v9,..-, U,—1}}: 


In particular L3 is the set of L-sentences. 


5.2 Exercise. Show that the following calculus C,,; permits to derive precisely 
those strings of the form x@ for which g € L* and for which x does not occur 
free in @. 


if f9,t, € T° and x ¢ var(t)) u var(t,); 
, ifReS is n-ary, to,...,t,-,¢ 7°, and 
x € var(to) u... U var(t,_1)5 


x @ 
xX Q. x W 
x79 x(p*y) 


X Rtg... ty-1 


for* = A, V,7,°; 


xVxp’ = x Ax@’ 


xX @ 
x Vy’ 


ifx zy; a ifx # y. 


CHAPTER III 
Semantics of First-Order Languages 


Let R be a binary relation symbol. The {R}-formula 

(1) Yvo Rvo Vo 

is, at present, merely a string of symbols to which no meaning is attached. 

The situation changes if we specify a domain for the variable vg and if we 

interpret the binary relation symbol R as a binary relation over this domain. 

There are, of course, many possible choices for such a domain and relation. 
For example, suppose we choose N for the domain, take “Vvg” to mean 


"for all neN” and interpret R as the divisibility relation R‘ on N. Then 
clearly (1) becomes the (true) statement 


jor allne N, RNnn, 
Le., the statement 
every natural number is divisible by itself. 
We say that the formula Yvg Rugt, holds in (N, R%). But if we choose the 


set Z of integers as the domain and interpret R as the “smaller-than" relation 
R* on Z, then (1) becomes the (false) statement 


jor alla € @, R@aa, 
1.e., the statement 


for every integer a,a <a. 


We say that the formula Vvg Rvp v9 does not hold in (Z, R*). If we consider 
the formula 


dro(Rv,v A Rug v2) 
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in (Z, R“), we must also interpret the free variables v, and v, as elements of 
Z. If we interpret t,; as 5 and v2 as 8 we obtain the (true) statement 


there is an integer a such that 5 < a anda < 8. 
If we interpret vr, as 5 and v, as 6 we get the (false) statement 
there is an integer a such that 5 < aanda < 6. 


The central aim of this chapter is to give a rigorous formulation of the 
notion of interpretation and to define precisely when an interpretation yields 
a true (or false) statement. We shall then be able to define in an exact way 
the consequence relation, which was mentioned in Chapter I. 

The definitions of "term”, "formula”, "free occurrence”, etc., given in 
Chapter II, involve only formal (1.e., grammatical) properties of symbol 
strings. We call these concepts syntactic. On the other hand the concepts 
which we shall introduce in this chapter will depend on the meanings of 
symbol strings also (for example, on the meaning in structures, as in the 
case above). Such concepts are called semantic concepts. 


| Structures and Interpretations 


If A is a set and n > 1, an n-ary function on A is a map whose domain of 
definition is the set A" of n-tuples of elements from A, and whose values lie 
in A. By an n-ary relation &{ on A we mean a subset of A’. Instead of writing 
(a,, .-.54,—,) € &, we shall often write Qay...a,—,, and we shall say that 
the relation & holds for a,, ..., a,— 1. 

According to this definition the relation "smaller-than" on Z is the set 
{(a, b)|a, be Zanda < b). 

In the examples given earlier, the structures (N, R“) and (Z, R“) were 
determined by the domains N and Z and by the binary relations R“ and 
R“ as interpretations of the symbol R. We call (N, R‘) and (Z, R7) {R}- 
structures, thereby specifying the set of interpreted symbols, in this case {R). 

Consider once more the symbol set S,, = {s,e) of group theory. If we 
take the real numbers R as the domain and interpret ¢ as addition over R 
and e as the element 0 of R, then we obtain the S,,-structure (R, +, 0). In 
general an S-structure YW! is determined by specifying: 


(a) adomain A, 

(b) (1) an n-ary relation on A for every n-ary relation symbol in S, 
(2) an n-ary function on A for every n-ary function symbol in S, 
(3) an element of A for every constant in S. 


We combine the separate parts of (b) and give: 


1.1 Definition. An S-structure is a pair Wl = (A, a) with the following prop- 
erties: 
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(a) Ais a nonempty set, the domain or universe of I. 

(b) a is a map defined on S satisfying: 
(1) for every n-ary relation symbol R in S, a(R) is an n-ary relation on A, 
(2) for every n-ary function symbol fin S, a( f)1is an n-ary function on A, 
(3) for every constant c in S, a(c) is an element of A. 


Instead of a(R), a(f), and a(c), we shall frequently write Ru, f™, and c™, 
or simply R“, f4, and c*. For structures Qf, 8, ... we shall use A, B,... to 
denote their domains. Instead of writing an S-structure in the form Yf = (A, a), 
we shall often replace a by a list of its values. For example, we write an 
{R, f, g}-structure as W = (A, Ru, f™, g”). 

In investigations of arithmetic the symbol sets 


Sar t= {+,°, 0, 1) and Sa = feo *, 0, I, <} 


play a special réle, where + and . are binary function symbols, 0 and | are 
constants, and < isa binary relation symbol. Henceforth we shall use Yt to 
denote the S,,-structure (N, +‘, .%, O%, 1%), where +% and -% are the usual 
addition and multiplication on N and 0“ and 1% are the numbers zero and 
one, respectively. 91' :-=(N, +%,.‘, 0%, 1%, <%), where <“ denotes the 
usual ordering on N, is an example of an S,--structure. Similarly we set 
R:=(R, += FORT and R<:=(R, + ,.%, 08, 1%, <™). We shall often 
omit the superscripts *," from +%, +%,...,1%, 1 It will, however, be clear 
from the context whether, for example, + is intended to denote the function 
symbol, the addition on N, or the addition on R. 
The interpretation of variables is given by a so-called assignment. 


1.2 Definition. An assignment in an S-structure YW isa map Pp: {v, |nEeEN}—> A 
of the set of variables into the domain A. 


Now we can give a precise definition of the notion of interpretation: 


1.3 Definition. An S-interpretation 3 is a pair (YI, 8) consisting of an S- 
structure 2 and an assignment f in QI. 


When the particular symbol set S in question is either clear or unim- 
portant we shall speak simply of structures and interpretations instead of 
S-structures and S-interpretations. 


: : ‘ ; ; a 
If f is an assignment in W, ae A, and x isa variable, then let f — be the 
x 


assignment in Y{ which maps x to a and agrees with f on all variables distinct 
from x: 


if 
peor fe ify # x, 


a ify =x. 


If 3 = (W, f) let 3“ = (2 p¢) 
xX x 
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In the introduction to this chapter we gave some examples showing how 
an S-formula can be read in everyday language once an S-interpretation has 


been given. It is useful to practice reading formulas under interpretations. 
< 


For example, if S = 85, and the interpretation 3 = (YW, f) is given by 
(*) W— (N, +,:°,0,1, <) and B(v,) = 2n forn > 0, 


then the formula v,.(v, tv.) =v, (actually.v, + vyv2 = v4) reads 
“4-(2 + 4) = 8”, and the formula Wg Jv,t) < v, (actually Vug dvy < vgb,) 
reads "for every natural number there is a larger natural number". 


1.4 Exercise. Let 3 be the interpretation defined above in (*). How do the 
following formulas read with this interpretation? 


Avo Vo + to = V5; = ‘Do = V,; Av Vo = P13 
Vig dbp = 045 Vv9 Vu, dv3(v9 < Vz A V2 < V;). 


1.5 Exercise. Let A be a finite nonempty set and S a finite symbol set. Show 
that there are only finitely many S-structures with A as domain. 


1.6 Exercise. For S-structures 2f = (A,a) and 23 = (B,b), let W x 23 be 
the S-structure with domain A x B:= {(a,b)|aeé A, be B}, which is deter- 
mined by the following conditions: 


for n-ary R in S and (a,, bg),...,(a,-,,5,-,})€ A x B, 
ROG. Byes Gece hg) UR ahi cay ag anid Re Dg cs Dna Js 
for n-aryfin S and (a,, bo),..., (@,-1,B,-1)€ A x B, 
OOo Do) sacks (Quis b,-1)) = (PGi aing Oy 2G) SF  Doweaies Daa) 
for cin S, 
cUXB ie (cc), 


Show: 


(a) If the S,,-structures Wand 23 are groups, then W x 23 is also a group. 

(b) If Wl and 23 are equivalence structures, then YI x 23 is also an equivalence 
structure. 

(c) If the S,-structures Wand 23 are fields, then YI x 23 is not a field. 


§2. Standardization of Connectives 


When we define the notion of satisfaction in the next section we shall refer 
to the meaning of the connectives "not”, "and", "or”, "if-then", and "if and 
only if”. In ordinary language their meanings vary. For example, "or"' is 
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sometimes used in an inclusive sense and at other times in the exclusive sense 
of "either-or". However, for our purposes it is useful to fix a standard 
meaning: We shall always use " or" in the inclusive sense, that 1s,a compound 
proposition whose constituents are connected by "or" is true (has the truth- 
value T) iff at least one of the constituents is true; it is false (has the truth-value 
F) iff both constituents are false. For example, we specify in 3.2 that a formula 
(~ v w) is assigned the truth-value T under an interpretation 3 if and only 
if @ is assigned the truth-value T under 3 or w is assigned the truth-value T 
under <3, On account of our fixed standard meaning we therefore have that 
(@ v w)is assigned the truth-value T under 3 if and only if at least one of the 
formulas @, y is assigned T under 3. 

According to our convention the truth-value of a proposition compounded 
by "or" depends only on the truth-values of its constituents. Thus we can 
use a function 


vi{T, F) x {T, F) > {T, F) 


to capture the meaning of "or"; the table of values ("truth-table"') is as 
follows: 


Similarly we proceed with the connectives "and", "if-then", "if and only 
if" and "'not”, The truth-tables for the functionsA , >, <>, and & are: 


4 
yom | 


These conventions correspond to mathematical practice. 

Connectives for which the truth-values of compound propositions depend 
only on the truth-values of the constituents are called extensional. Thus we 
use the connectives "not", "and", "or", "if-then", and "if and only if" 
extensionally. In colloquial speech, however, these connectives are often 
not used extensionally. Consider, for example, "John fell ill and the doctor 
gave him a prescription", and "The doctor gave John a prescription and he 
fell ill". By contrast with the extensional case, the truth-value of these 
compound statements also depends on the temporal relation expressed by 
the order of the two components (intensional usage). 
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When we restrict ourselves to using the connectives extensionally, we 
sacrifice certain expressive possibilities of informal language. Experience 
shows, however, that this restriction is unimportant as far as the formulation 
of mathematical assertions is concerned. Furthermore, we shall see in the 
following exercise that all other extensional connectives can be defined from 
the connectives we have chosen. 


2.1 Exercise. For n 2 | let B, be the alphabet {—, A, v, —,@,), GU 
{Po.+++,DPn—1}. We define the formulas d the language d propositional 
calculus (with the propositional variables po, ..., P,—1) to be the strings % over 
B,, which can be obtained by means of the following rules: 


F % em, 
ie KT), and oP for* = A, V, 7, ©. 
Pi 1a (a * B) 
For an n-tuple s = (So,..., 5,—,) of truth-values T, F, a so-called assignment, 


let «Ls] € {T, F) be defined by induction as follows: 
pils} = 5; Gi <n); 
Tats} = “(abs }); 
(x * B)[s] = *(aLs], Bls]) for*= a. v, >, 4%. 
als} is called the truth-value of « with respect to the assignment s. 
(a) Show that for every s = (So, s,) € {T, F}?, 


((Po A Pi) V (Po A 1P,))LSI = So. 


(b) An n-ary truth-function is a mapf: {T, F)" + {T,F). Show that for 
every n = | there are exactly 2°” n-ary truth-functions. 

(c) Show that for every n-ary truth-function f there is a formula & in the 
propositional calculus in which the symbols a, -, and «> do not occur, 
such that for all s € {T, F}", f(s) = x[s]. Prove the corresponding result 
for v, >, and «+ instead of a, +, and o. 

(d) Analogously, one defines the language of propositional calculus with 
propositional variables py, p,, p2,--., and considers corresponding 
assignments s = (Sp, 8,,52,--.) of the truth-values T and F. Show that 
if at most the variables po, p,,---,Py—1 Occur in a formula 2 of this 
language, then a[s] depends only on so,..., 5,—1- 


$3. The Satisfaction Relation 


The satisfaction relation makes precise the notion of a formula being true 
under an interpretation. Again we fix a symbol set S. By "term", "formula", 
or "interpretation" we always mean "S-term", "S-formula", or “S-inter- 
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pretation". As a preliminary step we associate with every interpretation 
= (UY, B) and every term ft an element 3(t) from the domain A: 


3.1 Definition. (a) For a variable x let 3(x):= B(x). 
(b) For a constant c €S, let 3(c) := cu. 
(c) For an n-ary function symbol f€S and terms tg,...,f,-4 let 


S(fto .-- tr—1) = f™(B(to), 5, Mtn 1): 

As an illustration, if S = S,, and 3 = (Ro, B) with Ro = (R, +, 0) and 
B(vo) = 2 and P(v.)=6, then Bvge(e°cr,)) = Blvg) + Weov,) =2 + 
(0+6)=8. 

Now, using induction on formulas @, we give a definition of the relation 
3 is a model of ~, where 3 is an arbitrary interpretation. If 3 is a model of g 
we also say that 3 satisfies ~ or that @ holds in 3, and we write 3 F @. 


3.2 Definition of the Satisfaction Relation. For all 3 = QI, B) we let 
SF lo Sty iff S(to) = S(t), 
Ze Rey eet, iE R'3(to).. Bty_1) (ie, R® holds for 
Dig cus ea): 
Se To iff not 3K @, 
SE (po A) iff 3— mand 3 yf, 
= (p v W) iff J gorIe y, 
3 = (gw) iff if 3 gp then? & y, 
JE (gow) iff 3 — wif and only if3& y, 


3 Vx@ - ifffor allae A Se, 


3 & dx iff there is an ae A such that 3- F @. 


(For the definition of 3 “ see §1.) 
x 


Given a set ® of S-formulas, we say that 3 is a model of ® and write 
JS Oif3t @ holds for all pe ®. 

By going through the individual steps of definition 3.2 the reader should 
convince himself that 3 — @ if w becomes a true statement under the inter- 
pretation 3. The steps in 3.2 involving quantifiers are illustrated by the 
following example. Again let S = S,, and 3 = (Mo, B) with Ry = (R, +, 0) 
and f(x) = 9 for all x. Then we have 


J Vu v9 °€ = % iff for allreR, 3 i a) 


iff forallre R, re 
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3.3 Exercise. Let P be a unary relation symbol and f be a binary function 
symbol. For each of the formulas Vu, ftgv, = v9, dvg Vv, fugv, = vy, and 
dup(Pvg A Vu, Pfvgv,) find an interpretation which satisfies the formula 
and one which does not satisfy it. 


3.4 Exercise. A formula which does not contain “4, «, or — is called 
positive. Show that for every positive S-formula there is an S-interpretation 
which satisfies it. (Hint: One can, for example, use a one-element domain.) 


$4. The Consequence Relation 


Using the notion of satisfaction we can state exactly when a formula is a 
consequence of a set of formulas. Again we assume a symbol set S is given. 


4.1 Definition of the Consequence Relation. We say that @ is a consequence 
of ® (written: ® — @) iff every interpretation which is a model of ® is also a 
model of ¢.' 


Instead of “{yv} E @” we shall write “y E @”. We have already sketched 
some examples of the consequence relation in Chapter I. Now we can 
formulate I.1.1. (existence of a left inverse) as follows: 


O,, F Vlg dvyv, ° Up =e, 
where 
®,, = {Vu Voy Vv2 09 ° (Uy ° U2) = (V9 2 04) © UD, 
Vtg tg ° e€ = Ug, Wg JU, ov, = e}. 
To show that a formula ¢ is not a consequence of a set of formulas 9, it 
is sufficient to give an interpretation which satisfies every formula in ® but 


fails to satisfy ¢. 
For example, one shows 


(1) not ®,, & Vg Vy09 o Vy = V1 ° Uo 


by giving as an interpretation a nonabelian group © with an arbitrary 
assignment of variables to elements of 8.Analogously, one can use an abelian 
group to show 


(2) not OD. H 1VU9 Viylo o Vy = Vy ° Ug. 


1 We use the symbol + for both the satisfaction relation (3 @) and for the consequence 
relation (Q'= @). The symbol preceding “|= ” (either for an interpretation, such as 3,or fora 
set of formulas, such as Q@)determines the meaning. 
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With (1) and (2) we see that 

not DE @ 
does not necessarily imply 

OE 19. 


In Chapter I it became clear— both by examples and in an informal way — 
that when ¢can be proved from a system of axioms @ then ¢ is a consequence 
of ®, There we raised the question as to what extent the consequences of a 
system of axioms can be obtained by mathematical proofs. The precise 
definitions of concepts given in this chapter and the next lay the foundation 
for a rigorous discussion of this question. In Chapter V we shall obtain the 
fundamental result that the consequence relation ®— @ can always be 
established by means of a mathematical proof. We shall see that such a proof 
consists of very elementary steps which, moreover, can be described in a 
purely formal way (that is, syntactically). 

Using the notion of consequence we are now able to define the notions of 
validity, satisjiability, and logical equivalence. 


4.2 Definition. A formula ¢ is valid (written: — ¢) iff O & @. 


Thus a formula is valid when it holds under all interpretations. For 
example, all formulas of the form (g v 1) or 3x x = X are valid. 


4.3 Definition. A formula ¢ is satisfiable (written: Sat ~) iff there is an inter- 
pretation which is a model of g. A set of formulas @ is satisfiable (written: 
Sat ®) iff there is an interpretation which is a model of all the formulas in ®. 
4.4 Lemma. For all ® and all 9, 
OME ¢— iff not Sat® vu {9g}. 

In particular, @ is valid iff ¢@ is not satisfiable. 
PROOF. D E ¢— 

iff every interpretation which is a model of ® is also a model of ¢, 

iff there is no interpretation which is a model of ® but not a model of ¢, 

iff there is no interpretation which is a model of ® u {7@}, 

iff not Sat ® u {7@}. LJ 


4.5 Definition. Two formulas @ and w are logically equivalent iff ep — w and 
we @. 


Thus two formulas g and w are logically equivalent iff they are valid 
under the same interpretations, that is, iff F pe JD. 
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It is immediately evident, from the definition of the notion of satisfaction 
together with the truth-tables for connectives that the following formulas are 
logically equivalent: 


ory and a(4@ v 7), 
o-w and Nev y, 
peop and fev wb)v WA 1@ v Ty). 


(+) 


Vxp and 4x 7 @. 


Therefore, we can dispense with the connectives A, —, and «+, and the 
quantifier V. More precisely, we define a map . by induction on formulas, 
which associates with every formula g a formula ~* such that ~* is logically 
equivalent to g and does not contain a, >, «, or V: 


g*:=@_ if @ is atomic, 


(Te) = 19%, 

(pv $)* =@* v $, 

(p a $)* = 179% v 1y*), 

(p > $)* = neg* v H, 

(po W)* = T1(g* v W*) vV 179% v TY"), 
(Axp)* = Ix@*, 
(Vx@)* = 44x 79%. 


Using (+) one can easily prove that * has the desired properties. 

In general a formula ¢ is easier to'read than the corresponding formula 
*, as is clear from (+ ). But because of the logical equivalence of g and @* 
we do not lose expressive power when we exclude the symbols a, >, «>, and 
¥ from our first-order languages. This will simplify our investigations of the 
languages; in particular, proofs by induction on formulas will be shorter. 
Thus we make the following conventions: 


(1) In the sequel we restrict ourselves to formulas in which a, >, 0, and ¥ 
do not occur. That is, in the common alphabet A of the first-order 
languages (cf. II.2.1.) we omit the symbols a.-—, <<, and V. In the defini- 
tion I1.3.2. of formulas we restrict cases (F4) and (F5) to the introduction 
of formulas of the form (g v yw) and 4xq, respectively. Finally, in the 
definition of the notion of satisfaction we eliminate the cases correspond- 
ing to a, +, ttand VY. 


(2 


—S 


Nevertheless we shall sometimes retain the symbols a, >, «, ¥ when 
writing formulas. Such "formulas ¢~ in the old style" should now be 
understood as abbreviations for ~*; for example, Vx(Px A Qx) should 
be understood as an abbreviation for 74x 17(7Px v 7Qx). 


§5. Coincidence Lemma and Isomorphism Lemma 


4.6 Exercise. For arbitrary formulas @, yw, and 7 show that 


(a) (ov We ziffge zand pF x. 
(b) F(g > W)iff@ ke y. 


4.7 Exercise. (a) Show that 4x Vvg & Vy 4x@. 
(b) Show that W 4x Rxy — 4x Vy Rxy does not hold. 


4.8 Exercise. Prove: 


(a) — Y¥x{p a WO Vx@ a Vx, 

(b) = dx(g v W)odxe v Ax, 

(c) FE Vx(9 v Weg v Vx, if x ¢ free(¢), 

(d) = Ix(g a WP) og a Axy, if x ¢ free(¢). 

(e) Show that one cannot do without the assumption "x ¢ free(@g)” in (c) 
and (d). 


4.9 Exercise. Let g@ and $be formulas such that — (g@ © $). Let 7’ be obtained 
from the formula y by replacing all subformulas of the form ¢ by w. 


(a) Define the map’ by induction on formulas. 
(b) Show that for all 7, E(y 7’). 


Historical Note. The precise development of semantics is due essentially 
to A. Tarski [27]. The notion of logical consequence was already present 
in work of B. Bolzano [3]. 


§5. Coincidence Lemma and Isomorphism Lemma 


It seems intuitively clear that the satisfaction relation between an S-formula 
q@ and an S-interpretation 3 depends only on the interpretation of the 
symbols of S occurring in g, and on the variables occurring free in ~. The 
following lemma gives an exact formulation. 


5.1 Coincidence Lemma. Let 3, = (ill,,8,) be an S,-interpretation and 


3. = (Y,, 8.) be an S,-interpretation, both with the same domain, 1.e., 
A, = A,. Put S =S, 7 Ss: 


(a) Let t be an S-term. /f 3, and 3, agree* on the S-symbols occurring in t 
and on the variables occurring in t, then 3,(t) = 3,(¢). 

(b) Let @ be an S-formula. If 3, and 3, agree on the S-symbols und on the 
vuriables occurring free in g, then 3, — g iff 3, —& Q. 


25, and 3, agree on k € § Or on x if k™! = k*? Or B,(x) = B(x), respectively. 
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PROOF. (a) We use induction on terms. 
t = x: By hypothesis, f,(x) = B,(x) and therefore 
3,(x) = Bi(x) = B2(x) = 32(%). 
t = c; Similarly. 
= fteicat, ay.) GS neary and tossecuteg el”): 
Si(fto... tras) = (Sito)... Stn 1) 
= £%(53,(to),..., 35(t,-1)) (by induction hypothesis) 
= £°°(3,(to),--+» Ta(tn-1)) (by hypothesis f™ = f ™) 
= 3(fto--. — 
(b) We use induction on S-formulas. 
p = Rty...ty-, (RES n-ary and ty,...,f,-,€ T°): 
3, Rty.-- ty, iff R™Sy(to) -- Ti(ta—1) 
iff R® Iyltg) 3 So(tn-1) (by (a)) 
iff R°°3,(ty)... 3,(t,-;) (by hypothesis R™' = R*?) 
iff 3, Rtg... t,—1. 
@ =f, = t,: Similarly. 
P= TW: 


3,— ay iff not 3,E YW 
iff not 3, — wy (by induction hypothesis) 
iff 3, —& Ty. 
= (w v x): Similarly. 
ge = 1xw: 


~ : : a 
J, dxy iffthere isan ae A, such that 3, an W 


iff there is ana € A,(=A,) such that 3, = E w 
Xx 


(by the induction hypothesis applied to yw, 3, and ay 4. note that, because 
x x 


free(w) < free(y~) u {x), the interpretations 3, = and 32 = agree on all 
symbols occurring in y and all variables occurring free in y) 


iff 3, Ixy. C] 


In particular, the coincidence lemma says that, for an S-formula @ and 
an S-interpretation 3 = (QM, f), the validity of @ under 3 depends only on 
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the assignments for the finitely many variables occurring free in ¢ (and, of 
course, on the interpretation of the symbols of S in Y). If these variables are 
AMONG Ug,+-+5U_—z: Le. if gE Le, it is at most the B-values a; = B(v,) for 
i=0,...,n — 1 which are significant. Thus, instead of (QI, 8) — @, we shall 
often use the more suggestive notation 


WE plag,-...,an-1]- 


(Similarly, for an S-term f such that var(t) c {vg,...,U,-,} we shall write 
t"[a,, ...,Q,—1] instead of 3(t).) When @¢ is a sentence, i.e., @ € L$, we can 
choose n = 0 and write 


WE gy, 


without even mentioning an assignment. In that case we say that YW is a model 
of gp. For a set of sentences ®, Yl = ® means that WE @ for every gE O. 


5.2 Definition. Let S and S' be symbol sets such that S c 8’; let YW = (A, a) 
be an S-structure, and YI’ = (A', a') be an S-structure. We call Yl a reduct of 
QI’ (more precisely, the S-reduct of 2’) iff A = A' and a and a' agree on S. In 
this case QW’ is called an expansion of W, and we write W = W' PS. 


For example, the $;--structure R * (the ordered field of real numbers) is an 
expansion of the S,,-structure R (the field of real numbers): & = R* ['S,,. 

If QW = W’ } S, then it follows from the coincidence lemma that for g € LS 
and a,, ...,@,-,€A, We @lao,...,4,-1] iff WE —la,...,a,-;]. To 
see that this holds we choose f: {v,: ne N} ~ A so that B(v,) = a, fori < n, 
and we apply the coincidence lemma for 3, = (QI, f) and 3, = (W’, f); 
3, and 3, agree on the symbols occurring in g and on the variables occurring 
free in @. 

The definitions of interpretation, consequence, and satisfiability refer to a 
fixed symbol set S. Using the coincidence lemma we can remove this reference 
to S. Let us consider, for example, the notion of satisfiability. If ® is a set of 
S-formulas and S'=> S, then @ is also a set of S’-formulas. As a set of S- 
formulas, ® is satisfiable if there is an S-interpretation which satisfies it, 
and as a set of S’-formulas it is satisfiable if there is an S’-interpretation which 
satisfies it. We have 


5.3 Lemma. ® is satisfiable with respect to S iff ® is satisfiable with respect 
to S'. 


Proor. If 3’ = (W, f’) is an S’-interpretation such that 3'— @®, then by 5.1 
the S-interpretation (2’ [ S, 6’) is a model of 0. On the other hand, if 
3 = (QI, f#) 1s an S-interpretation which satisfies @, we choose an S’-structure 
QW’ such that QW’ -S = W. (The symbols in S' — S can be interpreted arbi- 
trarily.) By 5.1 the S-interpretation (W’, £) is then a model of ©. LI 
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We conclude this section with a result about isomorphic structures. 


5.4 Definition. Let Ql and 8 be S-structures. 


(a) A map z:A—B is called an isomorphism of YW onto B (written: 
nm: U = 8 ) iff 


(1) 1 is a bijection of A onto B; 
(2) for n-ary RE S anda,, ...,a,-,;€A, 


R2ag...G,—-1 iff R®n(ag)... 2(a,_,); 
(3) for n-ary f’e S and @,...,a,-1 € A, 
Rf "(dos +++ Qn—1)) = f(g), -. +, WGn—1))5 


(4) for every cE S, a(c¥) = c"’ 


(b) 2 and B are said to be isomorphic (written: Yl = 8 ) iff there is an iso- 
morphism 7: % = 8. 


For example, the S,,-structure (N, +, 0) is isomorphic to the S,,-structure 
(G, +%, 0) consisting of the even natural numbers with ordinary addition 
+ %, The map 71:N > G such that x(n) = 2n is an isomorphism of (N, +, 0) 
onto (G, +%, 0). 

The following lemma shows that isomorphic structures cannot be 
distinguished by means of first-order sentences. 


5.5 Isomorphism Lemma. /f 2{ and 8 are isomorphic S-structures, then for 
all S-sentences 


Weo iff BE ¢. 


ProoF. Let 7: Y= 8. For the intended proof by induction it is convenient 
to show not only that the same S-sentences hold in YF and 8, but also 
that the same S-formulas hold if one uses corresponding assignments: 
With every assignment f in 2 we associate the assignment f*:= 70 B in 8 , 
and for the corresponding interpretations 3 = (Y, f#) and 3":=(8, f”) we 
shall show: 


(i) For every S-term t, 1(3(t)) = 3*(1). 
(ii) For every S-formula 9, 3 — @ iff 3" & ¢. 
This will complete the proof. 
(i) can easily be proved by induction on terms. (ii) is proved by induction 


on formulas g simultaneously for all assignments f in YW. We only treat the 
case of atomic formulas and the steps involving 7 and 3. 
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P= fo = t,: 
3et ef, iff (to) = 3(t,) 
iff n(B(to)) = a(B(t,) (since m7: A > B is injective) 
iff 3"(fo) = I(t) (by (i)) 
i le ae Fe 
Pp = Rigiwivdgeq: 
SE Rty...t,-1 iff R™3(to)--- Btn-y 
iff R®x(3(to))... m(B(t,-1)) (because 7: WU = B) 
iff RP3"(to)... 3*(ty-1) (by (1)) 
iff 3" = Rty...t,-1- 


p= 1: 
3 14 £xiffnotz3ey 
iff not3"— yw (by induction hypothesis) 
iff 3" = Ty. 
o = Ix: 


3 Sxy_ iff there is an a€ A such that 3-& 


a Ha 
iff there is an a € A such that (3 :| Ew 
(by induction hypothesis) 


iff there is an ae€ A such that 3" 


y 


(since (3 4) aa iy ~) 
x x 


b 
iff there is an element b € B such that 3" = E= wy 


na) 
x 


(since nm: A — B is surjective) 


iff 3" Ixy. O 
From the proof we infer 


5.6 Corollary. If x: Ul = 8B, then for g € L> and ay,...,a,-1 € A, 
WE glao....,a,-1] iff BE elalag),..., MGy-,)]. O 


5.5 tells us that isomorphic S-structures cannot be distinguished in L$. 
Conversely one could ask whether S-structures in which the same S-sentences 
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are satisfied are isomorphic. In Chapter VI we shall see that this is not always 
the case. For example, there are structures not isomorphic to the S,,-structure 
Yt of natural numbers in which the same first-order sentences hold. 


5.7 Exercise. Prove the analogue of 5.3 for the consequence relation. 


5.8 Exercise. Let S be a finite symbol set and let Y& be a finite S-structure. 
Show that there is an S-sentence @y the models of which are precisely the 
S-structures isomorphic to WW. 


5.9 Exercise. Show: 


(a) The relation < (“less-than”) is elementarily definable in (R, +, -, 0), Le., 
there is a formula ge Li*:*® such that for all a, be R, (R, +,°,0) & 
ela, b] iffa <b. 

(b) The relation < is not elementarily definable in (R, +, 0). (Hint: Work 
with a suitable automorphism of (R, +, 0), 1.e., a suitable isomorphism of 
(R, +, 0) onto itself.) 


5.10 Exercise. Let 1 be a nonempty set. For every i€1, let YW; be an S- 
structure. We write [lie ; 2; for the direct product of the structures Y,, 
ieél, that is, the S-structure Yl with domain Ulie A; :={glg:I- ier A;, 
and for all i €1, g{i) € A,), which is determined by the following conditions 
(for g € | ],;-, A; we also write <g(i)|i €1)): 


For n-ary R in S and for go,---5Gn—1 € | lier Ai: 
R7go---Gn—1 iff for allie LR go(i)... ga— 1); 
for n-ary fin S and gy, .--;G,-1 € [lier A,, 
IGos +++ Gn—1) = SS MGolis «5 nM ieD; 
for cin S, 
= de ieD. 


(a) Show that if t is an S-term such that var(t}) ¢ {vg,...,0,-,} and if 
Jos-.+9Gn-1 Ellie A,, then 


OP lgiscs Geel =<" 9s) e225 GO eb. 


Formulas which are derivable in the following calculus are called Horn 
formulas. 


(i) __, if @ is atomic; 
gif @ is atomic 


(11) ie if @ is atomic; 


(iii) ,if @o,..., @, are atomic andn > 1; 
(Po A+++ A Qn-1) > Pn 
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(iv) 


Q WW. 
(paw) 


? . 
(v) ee 


oe 
(v1) aaa 


Horn sentences are Horn formulas containing no free variables. 

(b) Show that if g is a Horn sentence and if every YW; is a model of ~, then 
[lier Ut; = y. (Hint: State and prove the corresponding result for Horn 
formulas. ) 


5.11 Exercise. A set ® of sentences is called independent if there is no gE ® 
such that ® — {g} — @. Show that the axioms for groups (cf. p. 32) and the 
axioms for equivalence relations (cf. p. 16) are independent sets of sentences. 
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As we saw in $4, the axioms for group theory can be formulated, or as we 
often say, formalized, in a first-order language. As another example of 
formalization we give the cancellation law for group theory: 


= Vin Vir Viste s Pe SS Ph Os SS Oa = By), 
o VU, VU2Wo ° U2 1°02 ) 1 


To say that the cancellation law holds in a group © means that © & g, and 
'to say that it holds in all groups means that ®,, F ¢. 
The statement "there is no element of order two" can be formalized as 


Wi= Tdvp(Tg Ze A Ug 2 Up =). 


The observation that there is no element of order two in (Z, +, 0) thus 
means that (Z, +, 0) is a model of w. 

For applications of our results it is helpful to have a certain proficiency 
in formalization. The following examples should serve this purpose. As the 
exact choice of variables is unimportant (for example, instead of using the 
formula @~ above we could have used 


V0.7 Vog V0y(0, 7° 0, = Ugo Vy > V17 = Vg) 


to formalize the cancellation law) we shall denote the variables simply by 
X, y, Z,..., where distinct letters stand for distinct variables. 
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6.1 Equivalence Relations. The three defining properties of an equivalence 
relation can be formalized with the aid of a single binary relation symbol as 
follows: 


Vx Rxx, Vx Vy(Rxv > Ryx), Wx Vy V2((Rxy a Ryz)— Rxz). 
The theorem mentioned in Chapter I 


If x and y are both equivalent to a third element then they are 
equivalent to the same elements, 


can be reformulated as 


For all x and y, if there is an element u such that x is equivalent 
to wand y is equivalent to u, then for all z, xisequivalent to ziffy 
is equivalent to z, 


and then formalized as 

Vx Vy(su(Rxu a Ryu) > Vz(Rxz@ Ryz)). 
6.2 Continuity. Let p be a unary function on Rand let A be the binary distance 
function on R, that is, A(@*g, 7,) = |rg — 7, | for ro, r, € I. Using the function 


symbols f (for p) and d (for A) we can treat (R, +,-,0, 1, <,p, A) as an 
S; vu {f, d}-structure. The continuity of p on R can be stated as follows: 


( For all x and for all £ > 0 there is a 6 > 0 such that for all y, if 
A(x, y) < 6 then A(p(x), p(y)) < &. 


Concerning the restricted quantifiers "for all e > 0” and "there is a 6 > O" 
that appear in (*) it is useful to observe that a statement of the form 


for all x such that ..., we have — 


can be formalized 


and a statement of the type 
there is an x with ... such that —— 


can be formalized 


A A ). 


Thus, using the variables u and v for ¢ and 6 we can give the following 
formalization of (*): 


Vx Vu(O < 1 > 4v(0 < v A Vy¥(dxy < vo dfxfy < u))). 


6.3 Cardinality Statements. The sentence 


@.5%= dun 30, vn = 
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is a formalization of "there are at least two elements". More precisely, for 
all S and all S-structures 1, 


We p,, iff Acontains at least two elements. 
In a similar way, for n > 3, the sentence 
Doni Wg... Wv,- (09 = Vy A+++ A Wg SUy-y A ttt A Wy 2 = V_-4) 
states that there are at least n elements, and the sentences 
Won and = Pan A WPon+1 


say that there are fewer than n elements and exactly n elements, respectively. 
If we now put 


®,, = {Qs,|Nn = pas 


then the models of ®,, are precisely the infinite structures, that is, for all S 
and all S-structures YI, 


Wi ©, iff A contains infinitely many elements. 


6.4 The Theory of Orderings. A structure W = (A, <™) is called an ordering 
if it is a model of the following sentences: 


Vx 1X < Xx, 
Mody VX Vy Vz((x < ya V<z)7> Xx < 2), 
Vx VW(x<yvxrBzyvy<x). 


(R, <®) and (N, <') are examples of orderings. If C denotes the set of 
complex numbers and <“ is defined by 

Zy ao oy iff Z,,2,€R and z, az 
then (C, <“) is not an ordering because the third axiom in ®,,, is violated. 
If for a structure W = (A, <™) we set 


field <" = {ae Alfor somebe Ava <"borb <*a},° 


then, for (C, <“), field <© = R and (field <©, <“) is an ordering. We say 
that <* is a partially defined ordering on A if (field <*, <*) is a model of 
: ie., if (A, <™) satisfies 


Vx 1X < x, 
Pooray VX VV VZ(X KV AV <z)>X < 2), 
Vx Vy(du(x <uvu<x)a duy<vvu0< y)) 
>(X<yvxe=yvy<x)). 


3 Of course not to be confused with the notion of field as in 6.5 
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6.5 The Theory of Fields. As symbol set let us take 5,, = {+,-,0, 1). An 
S,,-structure is a field if it satisfies the following sentences: 


Vx Vy Vaxty) tzext(yt2, VWxxt0=x, 


Vx Vy Vz(x.y).z =x.(y.2), Vx x-l=x, 

Vx dyx ty =O, Wx(7x =O0->4dyx-y=1), 
. Ve VV Xa y Spor x, WX Vy X-ySy-x, 

70 = 1, 


Vx Vy Vz x-(y + z) = (x-y) + (x-2). 
Ordered fields are S,,-structures which satisfy the following sentences: 
the sentences in ®,, and ®,,4, 
Dora VX Vy Voix <yoxtz<ytz), 
Vx Vy ¥z(x <y A0 <z)>Xx-z << y-z). 
6.6 Exercise. Formalize the following statements using the symbol set of 
6.2: 


(a) Every positive real number has a positive square root. 

(b) If p is strictly monotone then p is injective. 

(c) pis uniformly continuous on R. 

(d) For all x, if p is differentiable at x then p is continuous at x. 


6.7 Exercise. Let S,, = {R}. Formalize 


(a) R is an equivalence relation with at least two equivalence classes. 
(b) R is an equivalence relation with an equivalence class containing more 
than one element. 


6.8 Exercise. Use 5.10 to show: 


(a) If, for every ie I, UW; is a group then | ];., UW; is a group. 
(b) There is no set of axioms for the theory of orderings and for the theory 
of fields which consists of Horn sentences only. 


6.9 Exercise. A set M of natural numbers is called a spectrum ff there is a 
symbol set S and an S-sentence ¢ such that 

M = {né N}{¢ has a model containing exactly n elements) 
Show: 


(a) Every finite subset of {1, 2, 3,...} is a spectrum. 
(b) For every m = 1, the set of numbers > 0 which are divisible by m is a 
spectrum. 
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(c) The set of squares > Ois a spectrum. 
(d) The set of nonprime numbers > 0 is a spectrum. 
(e) The set of prime numbers is a spectrum. 
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In the preceding section we had a number of examples showing how mathe- 
matical statements can be formalized by first-order formulas. However, the 
process of formalization is not always as simple as it was in those cases. In 
this section we discuss some of the typical difficulties which can arise. 


7.1 Partial Functions. When we defined the notion of structure we stipulated 
that function symbols be interpreted by total functions, 1.e., in the case of an 
n-ary function symbol, by a function that is defined on all n-tuples of elements 
of the domain. If, for example, in the field of real numbers, we regard division 
on R as a function then we do not have a structure in our sense (because the 
quotient is undefined if the divisor is zero). The following are possible 
solutions to this difficulty: 


(1) The division function can be extended to a total function. For example, 
one can define +/0 := 0 for all r € R and take this into consideration when 
formulating statements about the division function. 

(2) Instead of the division function, one can consider its graph, that is, the 
ternary relation {(a,b,c)¢ R*|b #0 and a/b =c). In VIII.1 we shall 
describe how statements about functions can be translated into state- 
ments about their graphs. The remarks made there for total functions 
can easily be modified to cover the case of partial functions. 

(3) One can develop semantics for first-order languages which also include 
partial functions. However, this approach leads to a complicated logical 
system without yielding anything essentially new, as we see from (1) 
and (2). 


7.2 Many-Sorted Structures. The structures we have hitherto considered 
have only one domain and in this sense consist of elements of only one sort. 
On the other hand, some important structures in mathematics contain 
elements of different sorts. Planes in affine spaces consist of points and lines, 
and vector spaces consist of vectors and scalars. Taking vector spaces as an 
example, we give two possibilities for treating many-sorted structures. 


(1) Many-Sorterl Languages. We regard a vector space % as a "Structure 
with two domains" (as a so-called two-sorted structure): 


B = (F,V, +*,-7,0%5 17, Oe", «F"), 
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where F is the set of scalars, (F, +7, -’, O*, 1") is the field of scalars, V is the 
set of vectors, (V, ®”, e’) is the additive group of vectors, and **:" is the 
multiplication of scalars and vectors defined on F x V. 

In order to describe such two-sorted structures we introduce a two-sorted 
language, that is, a language built up in the same way as the languages we 
have used so far, but having two sorts of variables, namely ug, uy, u2,..- 
(for elements of the first domain, in the case above, scalars) and tg, 1, U2,... 
(for elements of the second domain, in the case above, vectors). 

A quantified variable always ranges over the corresponding domain. To 
illustrate this we formalize some of the axioms for vector spaces. 


(a) Associativity of scalar addition: 

Vito Vuly Viz Ug + (Uy + Un) = (Ug + Uy) + UD. 
(f) Associativity of vector addition: 

Vig Vv, Vv Vg © (v, © v2) = (vg © 01) @ 0. 
(y) Associativity of scalar multiplication of vectors: 

Vulg Vu, Vo(tg- U1) * Up = Ug * (Uy * Uo). 

(2) Sort Reduction. It is also possible to use our one-sorted first-order 
languages to treat many-sorted structures, namely, by a so-called sort 
reduction. We demonstrate this method briefly for the case of vector spaces. 


Let F and K be two new unary relation symbols. We regard a vector space 
asa {F, V, ++,0, 1, ®, e, *)-structure 


B = (F u V, F®, v%, oP. he 0°, 1. Q*, e", #.); 


where F® := Fand V"':= V,and the functions +®,-¥, @"',and *® are arbitrary 
extensions of +7, -?, ®” and **'” to(Fu V) x (Fu V). The introduction 
of the "sort symbols" F and V enables us to speak of scalars and vectors. We 
exemplify this by reformulating the many-sorted vector axioms given above: 


(a) Vx Vy W2(Ex A Fy A Ez) 3 (x ty) tz=x +(y + 2)). 


(B) Vx Vy V2(Vx a Vy a Vz > (x ®y) Oz =x O(y@ 2)). 
(y) Vx Vy V2(EFx a Fy a Vz) > (x. y)*# z= x * (yp * Z)). 


Since all quantifiers are "relativized" to F or V, it makes no difference how 
the functions +*,... are extended. 


7.3 Limits of Formalizability. The question of the limits of formahzability, 
which is ultimately the question of the expressive power of first-order 
languages, will be treated in detail in Chapter VI and Chapter VII, §2. Here 
we discuss two examples. 


(1) Torsion Groups. A group © is called a torsion group if every element of 
@® has finite order, i-e., if for every a € G there is an n > | such that a" = e%, 
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An ad hoc formalization of this property would be 
Vx(x =evxox=ev (xoxox =ev...). 


However, in first-order logic we may not form infinitely long disjunctions. 
Indeed we shall later show that there is no set of first-order formulas, the 
models of which are precisely the torsion groups. 


(2) Peano’s Axioms. We consider the question whether there is a set of 
S,,-sentences the models of which are the structures isomorphic to 9t = 
(N, +,-,0, 1). For simplicity we shall start our discussion with the structure 
M, = (N, a,0), where a is the successor function on N (a(n) = n + 1 for 
ne). MM, is a {a,0)-structure, with @ ("successor") a unary function 
symbol. The results can easily be extended to it, cf. Exercise 7.5. 

Jt, satisfies the so-called Peano axiom system: 


(a) 01s not a value of the successor function a. 

(f) o is injective. 

(y) (the so-called induction axiom). For every subset X of N: if 0€ X and if 
o(n) € X whenever né€ X, then X = N. 


(a) and (f) may easily be formalized in L'%° by 


(P1) Vx nox =0; 
(P2) Wx V¥(ox = oy > x Sy). 


The induction axiom (y) is a statement about arbitrary subsets of N. For an 
"ad hoc” formalization of this axiom we would also need to quantify over 
variables for subsets of the domain. In such a language, (y) could be formal- 
ized thus: 


(P3) VX((X0 a Vx(Xx — Xoax)) > Vy Xy). 


(P3) is a so-called second-order formula (cf. IX.1). 
The following theorem shows that (P1)-(P3) characterize the structure 
Nt, up to isomorphism: 


7.4 Dedekind’s Theorem. Every structure %! = (A, o4,O*) which satisfies 
(P1)~(P3) is isomorphic to %,. 


In VI.4 we shall show that no set of first-order {g, 0)-sentences has (up to 
isomorphism) just %¢, as a model. Thus the induction axiom cannot be 
formalized in L'%:%, 

The proof of Dedekind's theorem depends essentially on the fact that in 
structures 2 which satisfy (P3), the following kind of inductive proofs can 
be given: In order to show that every element of the domain A has a certain 
property P, one verifies that 0* has the property P and that if an element a 
has the property P then o“(a) does also. 
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Suppose Y! = (A, a4, 0%) is a structure which satisfies (P1)-(P3). The 
isomorphism 2: 9t, = W, which we need, must have the following properties: 


(i) nO’) = O*; 
(ii) z(o™(n)) = a*(n(n)) for all n € N, 


that is, 


(i)' (0) = 0°; 
(ii)) (an + 1) = o4(x(n)) for all nEN. 


We define x by induction on n, taking (i)' and (ii)’ to be the definition. Then 
the compatibility conditions for an isomorphism are trivially'satisfied and 
we need only show that 7 is a bijective map from N onto A. 

Surjectivity of x: By induction in % (2 satisfies (P3)) we prove that every 
element of A lies in the image of 7. By (i)’, 0“ is in the image of x. Further, if 
a is in the image of x, say a = a(n), then o“(a) = o4(n(n)); hence by (ii), 
a‘(a) = nn + 1), and it follows that o“(a) is also in the image of 2. 

Injectivity of ~: By induction on n we prove 


(*) For all me N, if m 4 nthen z(m) # x(n). 


n=0: If m#0, say m=k +1, then a(m) = x(k + 1) = o4(x(k)), and 
since QI satisfies the axiom (P1), a4(2(k)) # O*. Hence, by (i)', 7(m) # 7(0). 

The induction step: Suppose (*) has been proved for n and suppose 
mn + 1.Ifm =0,weargueasinthecasen = Othat x(n + 1) ¥ x(m) = 0°. 
IF m #0, say m= k + 1, then k # n and so, by the induction hypothesis, 
n(k) # n(n). By the injectivity of _* (QU satisfies (P2)) it follows that a4(n(k)) 4 
a‘(n(n)); hence from (ii)'’ we have x(k +1) 4 nn t+1), Le, x(m) #4 
n(n + 1). = 


7.5 Exercise. Let I] be the following system of second-order S,,-sentences: 


Y¥x 7x + 1=0, 

Vx V(x +1ley4+15x=y), 
¥X((XO a Vx(Xx > Xx + 1)) 3 Vy Xy), 
Wx xX +0= x, 
VxVyx+(vytD=e4+y) 41, 

Vx x-0=0, 

Wx Vyx-(ytD=(-y)+x. 


(a) If W = (A, +4, 4, 04, 14) is a model of M1 and if 4: A > A is given by 
a4(a) =a +4 14, then (A, a4, 0°) satisfies the axioms (P1)-(P3). 
(b) Show that Jt = (N, +, -,0, 1) is characterized by I up to isomorphism. 
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In this section we define how to substitute a term ¢t for a variable x in a formula 
@ at the places where x occurs free, thus obtaining a formula yy. We wish to 
define the substitution in such a way that @ expresses the same about x as 
y does about ¢t. First we give an example to illustrate our objective and to 
show why a certain amount of care is necessary. 

Let 


@:=dzz+7z=.x. 
In Jt the formula ¢ says that x is even; more exactly, 


(Nt, B) = o iff BCX) is even. 


If we replace the variable x by y in g we obtain the formula Jz z + z = y, 
which states that y is even. But if we replace the variable x by z, we obtain 
the formula jz z + z = z, which no longer says that z is even; in fact, this 
formula is valid in 9t regardless of the assignment for z (because 0 + 0 = 0). 
The meaning is altered in this case because at the place where x occurred 
free, the variable z gets bound. On the other hand, we obtain a formula which 
expresses the same about z as g does about x if we proceed as follows: First 
we introduce a new bound variable u in ~, and then in the formula 
Ju u + u = x thus obtained we replace x by z. It is immaterial which variable 
u (distinct from x and z) we choose. However, for certain technical purposes 
it is useful to make a fixed choice. 

In the preceding example we replaced only one variable but in our exact 
definition we specify the procedure for simultaneously replacing several 
variables: With a given formula @, pairwise distinct variables Xg,..., %,, 


and arbitrary terms fg,...,¢,, we associate a formula 
cue 
: ere od 


which is said to be obtained from @ by simultaneous substitution of ty,...,t, 
for Xq,...,%,: The reader should note that the x; need be replaced by 1; 
only if 


x; € free(¢) and Xx; # ty. 


In the following inductive definition this is explicitly taken into account in 
the quantifier step; in the other steps it follows immediately. 

It will become apparent that it is convenient to first introduce a simul- 
taneous substitution for terms. Let $ be a fixed symbol set. 
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8.1 Definition 


(a) 


(b) 


(c) 


Pa ee ae xX, WXA Xoy0.0,% F*X,, 

Manca, fi, Ux ax, 

fe sissy 

Cc = C¢, 

eee 

Brive te ven ianente fic owit 
tr... t._,)—— = ft, ——_—_ ae 
Lito : oe a Laat Ons Seal de 


For easier reading we use square brackets here and in what follows. 


8.2 


(a) 


(b) 


(c) 


(d) 


(e) 


Definition 
fn an etile unity, in Agaaelp. -. + teteats 
[to = t,] —— = fp = ty , 
X geese ke pa ariae, 3 Ng tak, 
Eg cect fo... ty. 
[Rey ... thy] = Rtg Ot OO, 
eee Mjccnn oe par ke 
ee | be dcnbdy 
[n9]-——— = |¢ ; 
MGcioke See 
Paar 2 belicst: toiexct 
@v y)_—_ = @ vu : 
Tae, oe KGeod Xt oe 
Suppose X;,,--.5%;,_, ig <... <i,-,) are exactly the variables x; 


among the Xg,..., X, such that 


x, € free(Ax@) and xX, Ft. 


Then set 


pst f....t;  u 
[Axe] 00 = Fy g 2s 
Mastek Mic beer. 
where u is the variable x f x does not occur in t,,,..., tj, ,3 otherwise u 


is the first variable in the list vg, vy, v2,... which does not occur in ¢, 


lig» a) ti eoy 
(By introducing the variable uw we ensure that no variable occurring 
INT;,,...,6,-,; falls within the scope of a quantifier. In case there is no 


x, such that x, € free(4dxq@) and x; # t; we have s = O, and from (e) we 


obtain 
eee 2 x 
Jxp] °°" = ~ 
[Axe] Pe aslo ‘| 


which is 3x@, as we shali see in 8.4(b).) 


88, Substitution 


EXAMPLES. For binary P and f we have 


D2UVgV, 
CG) [Pug fvye,] ——— = Pro fvzvo, 
U102V3 
V4 fr, ; Jv 1V1Vo a, 
(2) [vo Pu fvyvz] aria 3to| Poo fos ————~ | = dug Pug fo, fre, 
o¥2 2 Yo 


Up Uv Vow 
(3) [Sug Pug ft,v,] ———*_ = Jey | Peo seve al = Jv3Prv,3 ftgv2 
D2 U9 b1V9 


At the places where x; occurred free in ~, we now find in 
the term t;. Hence, if free(p) < {x9,...,x,} then we expect that 


will hold for an interpretation 3 = (QI, f) iff @ holds in YW, provided we use 
the assignments <S(tg) for x9,.-., s{t,) for x,. An exact formulation of this 
property is given in the following "substitution lemma" 8.3. Later we shall 
frequently refer to this lemma whereas we shall rarely return to the technical 
details of definition 8.2.* Before stating the lemma we generalize the definition 


of ae Let Xo,..., xX, be pairwise distinct and suppose 3 = (QI, f) is an 


interpretation, and a,, ...,a, € A; then the assignment 


B Qo.--4, 
Nip ame Xp 

in Yl and the interpretation 
= dgsecdy 
Np cies 


are given by 
gee Gy (y) ify #X,...,y# x, 
_ oo a; ify = x; 


and 


adg--.a ag...@ 
got (q, p 0M) 

Na axs X, Xoo dy 
+ As the substitution lemma the further results of this section are intuitively clear. The proofs 
are straightforward but lengthy and may be skipped by a reader already familiar with proofs 
by induction on terms and formulas. 
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8.3 Substitution Lemma. (a) For every term ¢, 
ere: lta) sas Ch. 
Mie sce ep Rij ae My 

(b) For every formula @, 

oe are Wty)... B(t,) 

Oe. pa ae 

Keter, if Nias ks me 

PRoorF. By induction on terms and formulas in accordance with the defini- 

tions 8.1 and 8.2. We treat some typical cases. 


f=x:lIf x #Xo,...,%,, then, by 8.1(a) 


and therefore 


3(s rd LAG)= 3 Sto) -- 3) 
x x 


(idle Ne Neensske 


If x = x,, then 


x Lo . L = ie 
Micaiets Ne 
and hence 
3s forces = | = Xt) = = Sto) SS) ej< 5 Sto) saz CES) (x) 
Sa en XgceaX, po ees. 


gp = Rto...t,-4: 


BS] Rigas Ea] 


f,...f fy ...f 
Ovrt riff S(R) holds for 3(" 2 ae 
Ngee x Monee 


r r 


(by 8.2(b)) 


iff 3(R) holds for 3 0) -+ 30) 5) 
ee 


(by (a)) 


r 


5 
iff 3 Seo). St) (R) holds for 


nore 
3 Slo) + SH) 
a, ¢ 


NG 


iff z (tg) one ~(t,) 


Xgui. X 


(fakes 


r 


E Rig. .<b,-4: 


r 
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g = xy: As in 8.2(e), let x;,,...,%;,_, be exactly those variables x; for 
which x, € free(4xw) and x; # t;. Then, for u chosen as in 8.2(e), 


en 
3 = [ay] — 
L WA ee 
ae ee 
iff Se Buf y Get sot 
Rik cise Nps 
a f,....8f; U 
iff there is an ae A such that 3-— wy —® a4 
U Xig res Xi, 


3=(t;,)... = (,.,)3= Ww) 
. . a { u U 
iff there is an aéA such that E ‘| $F 


(by induction hypothesis) 
F 


iff there is ana eA Such that 


4 S(t;,)... S(t, Ja ag 


Mig coe Xp 


(by the coincidence lemma, since u does not occur in f;,...., fi,_,) 


Kt; M(t. 
iff there is anaeéA such that 3 (ti)... Sti, Ja 


Ew 
ig t '* Xj,-1% 


(by the coincidence lemma, since either u = x or u does not occur 


in yw) 
S(t; Kt; a 
iff there is an a € A such that E (lig) “ | ‘ Ew 
(note that x # X;,,...,X # X;,_,, because x;,,..., X;,_, € free(Axw)) 
B(t;,)... S(t; 
iff a iy) be sd o) ( =e Ixy 
Rice Re 2, 
ia aera t 
2 gto) + WO) ay 
eee 
(since for i # ig,..., i, 4,X; € free(dxw) or x; = t,). i] 


In the following lemma we collect several "syntactic" properties of 
substitution. 


8.4. Lemma. (a) For every permutation z d the numbers 0,..., 7, 


tg sesty tea ++ trie) 
Xo aseiX, Xi gy see Raa 
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(b) FO <i<rand x; = ¢;, then 


Xx 
In particular, @ oe 


(c) For every variable y, 
ers 


(i) Ifye me ed then (y € var(fg) u... u var(t,)) or (y € var(t) 
and Vy # X9,...,) # X,); 

(ii) ifye reo aa] then (y € var(tg) u... u var(t,)) or (y a free(~) 
and ¥y # Xo,---, ¥ # X;,). 


Proor. By induction, using 8.1 and 8.2. We give two typical cases of (c). 


fy ...f 
ye vari x 
Nici oe ee 


In case X # Xg,..., X # X, we have 


t = x: Suppose 


prea a 
x : 
Xpkiek, 
hence y = x and so (y € var(x) and y # Xo,..., y # X,). In case X = X; we 
have 
Lawscclk 
x, = Ty, 
re 


hence by assumption y a var(t,;), that is, y € var(fj) u... U var(t,). 


p = Ixy: Let s,ig,..., i, and u be as in definition 8.2(e). Suppose 
Poecet a Pe 
f r us = lo ts-1 . 
ye ree( [3301 es 4 ree ily a ae ue “]) 


lanes Uys 
fi : ; 
y#u and ya ree(v } 


ip tt ¢ Xig-y 


Then 


thus, by induction hypothesis, y # u and (y a var(t;,) u... u var(é;,_,) u {u} 
ory € free(w), y # X;,,.--,y F Xte-1 Y FX). Since for i ¥ ig,...,i,.4 we 
have x, ¢freeQy) or x; = ¢t;, it follows that ya var(tg) u... U var(t,) or 
ye free(w), VA Xq,...,V FX. 
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8.5 Corollary. Suppose free(@) c {x9,...,X,}, where we continue to assume 
that X9,.--,X, are distinct. Then, for terms ty,...,t, such that var(t;) c 
{09 ++. 0,1} the formula 


ee 
ve Vacanieke 
is in L,. In particular, 
Chee 
/p 
ee 
is a sentence. Ee] 


We call the number of connectives and quantifiers occurring in a formula 
~ the rank of ~, written rk(@). More precisely: 


8.6 Definition. 
rk(g):=0, if gis atomic, 
tk(7@) = rk(p) + 1, 


rk(y v W) = rk(p) + rkQy) + 1, 
rk(Sxq@) := rk(g) + 1. 


From the definition of substitution one obtains immediately: 


8.7 Lemma. 
tp... ¢ 
k{ op 4 } = rk(g). 
r(o =) rk(@) 


The quantifier "there exists exactly one" can be conveniently formulated 
with the use of substitution. Let @ be a formula, x a variable, and y the first 
variable which is different from x and does not occur free in g. Then we 


write 1~'xq@ ("there is exactly one x such that »”) for Ix(g a Vy(e = = 


X = y)). It can easily be shown that for every interpretation 3 = (QI, ), 


3 3° 'xq_ iff there is exactly one a € A such that 3 _— Q. 
x 


8.8 Exercise. For n > 1 give a similar definition of the quantifiers "there 
exist exactly n” and "there exist at most n". 
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8.9 Exercise. Let P and f be binary and set x = 09, y = 04, U = 02,0 = Us, 
and w = v4. Show, using definition 8.2 that 


dx dy(Pxu a Pyv) oe 


dx dy(Pxu a Pyu), 
Xyv 


v fuv 
(b) dx Jy(Pxu a Pyv) ~~ = 3x Ay(Pxv a Pyfur), 


‘uv 
dx dy(Pxu a Pyt) nd = dw dy(Pwx a Pyfuv), 


(d) [Vx 3y(Pxy a Pxu) v Jufuu = x] ee — Vu Jw(Pow A Pofxy) 
V dufuu = x. 


8.10 Exercise. Show that if Xo,..., x, € var(tg} u... u var(t,) then 


sete 
k= (pp ——— +9 Vg... X, (Xo EH tg Av AX, = t, > @). 
ee 


8.11 Exercise. Give a calculus in which the derivable strings are exactly those 
of the form 


t-t——— or PXo...X, to..-t, @ 


(Hint: For (a) and (c) in 8.1 one can choose the following rules: 


x Xe PX Ging ee Xa 


‘ 1 ee 


if f €S andf is n-ary. 


CHAPTER IV 
A Sequent Calculus 


In Chapter I we discussed the way in which a mathematician proceeds to 
develop a particular mathematical theory: In order to obtain an overview 
of the theory, he tries to find out what propositions follow from its axioms. 
To show that a proposition follows from the axioms he supplies a proof. 
Now that we have an exact definition of the notion of consequence, we are 
sufficiently equipped to give a more thorough discussion of the goals and 
methods in mathematics. If S is a symbol set and ® is a set of S-sentences, 
we let ®* be the set of S-sentences which are consequences of ®. A mathe- 
matical proof of an S-sentence ¢~ from the axioms in ® shows that @ belongs 
to ®*. For example, consider the set ®,, of axioms for groups, where S = S,,. 
The proof of I.1.1 then shows that the S,,-sentence Vx dy y ° x = e belongs 
to ®;,. However, in view of the goals of the mathematician and the scope 
of his methods, a central question is whether every sentence in ®* can be 
proved from the axioms in ®, In order to answer this we must analyse the 
notion of proof. But even if we limit ourselves to statements which can be 
formulated in first-order logic, we encounter difficulties at the very outset 
of such an attempt. The difficulties arise from the fact that mathematicians 
do not have an exact, fixed notion of proof. A mathematician does not learn 
what a proof is from a list of permissible inferences; rather he gets acquainted 
with this notion by doing concrete proofs in the course of his mathematical 
education. Furthermore, the collection of commonly accepted methods of 
proof is continually being expanded by the addition of new variants. Last, 
but not least, the development of new theories often includes the invention 
of new proof techniques. 

In view of this situation we shall not attempt to give an exact description 
of the whole spectrum of mathematical arguments. Rather we shall look at 
some concrete proofs and try to abstract fromthem certain basic constituents. 
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From these constituents we shall build up a precise notion of proof. It will 
turn out that they are sufficient to reconstruct all types of mathematical 
arguments. Thus we proceed as we did when we introduced the precise 
notion of mathematical statement, where instead of trying to give an exact 
description we used the first-order languages to give a clearly defined frame- 
work. In the case of first-order languages we shall merely be able to 
make it plausible that, in spite of their limited expressive power, these 
languages are in principle sufficient for the purpose of mathematics (cf. 
VII.2). By contrast, we can really prove that every sentence in ®” is provable 
from sentences in ® in the precise sense. 

How can we single out basic constituents of mathematical deductions? 
If we analyse the proofs in Chapter I, for example, we see that those steps 
which are directly related to the meaning of the connectives, the quantifiers, 
and the equality symbol seem very elementary. We mention three examples. 
In a proof one can proceed from statements @ and w, which have already 
been obtained, to the conjunction (@ a $); similarly one can proceed from 
Ft to 4x Px and from Px and x = tf to Pt. We can represent these rules 
schematically as follows: 


~, wW Pt Px,x =t 
(pA Wy) dx Px' zt 


Written in this way, these constituents of proofs can be regarded as syntactic 
operations on strings of symbols. Adhering consistently to this point of view, 
we shall set up a list of deduction rules (in Sections 2 and 4) in this way 
obtaining a calculus 6. We shall motivate its form in §1. In §6 (with a preview 
in $1) we shall give the fundamental definition for the notion of a formula ¢ 
being formally provable from a set ® of formulas. This definition will be based 
on the notion of derivability in 6. Formal provability is the syntactic counter- 
part of the semantic notion of consequence. 
Throughout this chapter we fix a symbol set S. 


(*) 


$1 Sequent Rules 


A mathematical proof proceeds from one statement to the next until it 
finally arrives at the assertion of the theorem in question. The individual 
statements depend on certain hypotheses. These can either be hypotheses of 
the theorem or additional hypotheses temporarily assumed in the course of 
the proof. For example, if one wants to prove an intermediate claim ~ by 
contradiction one adds “¢ to the hypotheses; if a contradiction results 
then g has been proved, and the additional assumption “@ is dropped. 
This observation leads us to describe a stage in a proof by listing the 
corresponding assumptions and the respective claim. If we call a nonempty 
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list (sequence) of formulas a sequent, then we can use sequents to describe 
stages in a proof. For instance, the stage with assumptions @y...@,., and 
claim ¢ is rendered by the sequent (9... @,—-,;,@. The sequence fo... Pp 
is called the antecedent and @ the succedent of the sequent fo... @n- 19. 
From II.4.3 it follows that the formulas which constitute a sequent are 
uniquely determined. In particular, the antecedent and the succedent are 
well-defined. 

In terms of sequents, the indirect proof sketched above can be represented 
schematically as follows: 


Po-++Pna-1 TP W 


(+) Po-++Pn-1 TE TW 
Po--+Pn-1 /p 


Thus (+) describes the following argument: If under the assumptions 
Por..., @,—1 and (the additional assumption) @ one can obtain both the 
formula y& and its negation “wy (that is, a contradiction), then from the 
assumptions g,..., P,—-, One can infer ¢. 

In the following we shall use the letters I, A,... to denote (possibly 
empty) sequences of formulas. Then we can write sequents in the form 
Tow, A$, ... and the scheme (+) as 


(+ +) 


(As in (+), we use spaces between elements in a sequent merely for easier 
reading.) 

According to the concepts which we have developed so far, each step in 
a proof leads from certain stages already attained to a new one and hence 
from sequents to a new sequent. Thus it seems natural to represent deduction 
rules such as (++) as rules of a calculus ©, which operates on sequents 
(sequent calculus). Our conception of & is based upon [16]. For comparison 
the reader can find calculi of a different nature in [25]. 

Before listing the rules of & in the next section, some further remarks will 
be helpful. 

If, in the calculus 6, there is a derivation of the sequent Ig, then we write 
t- Ig and say that Ig is derivable. 


1.1 Definition. A formula ¢ is formally provable or derivable from a set ® of 
formulas (written: ®+ ) if and only if there are finitely many formulas 
@o,-++5M,—, 1n M such that F Po... Y,_1Q. 


Asequent I is called correct if ! = ~, more precisely, if {yy |y is a member 
of T'} & @. Since the rules of S are modelled after usual mathematical 
inferences, it will turn out that they are correct, i.e., when applied to correct 
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sequents they yield a correct sequent. As a result, every formula which is 
derivable from ® also follows from O. We convince ourselves of the correct- 
ness of each rule as soon as we introduce it. 


§2. Structural Rules and Connective Rules 


We divide the rules of the sequent calculus © into the following categories: 
structural rules (2.1, 2.2), connective rules (2.3, 2.4, 2.5, 2.6), quantifier rules 
(4.1, 4.2), and equality rules (4.3, 4.4), We start with the two structural rules. 


2.1 Antecedent Rule (Ant). 
T 
r if every member of I is alsoa member of I" (briefly: if T < 1"). 
Note that a formula which occurs more than once in I- need only occur 
once in I”, 


2.2 Assumption Rule (Ass). 

r @ if pis a member of Y. 
CORRECTNESS. (Ant): If a sequent Ig is correct and I cI”, then since 
I’ & ¢, also I’ & @. (Ass) is correct since ® — g always holds for pe O. O 


(Ass) reflects the trivial fact that one can conclude ¢ from a set of assump- 
tions which includes @. (Ant) expresses the fact that one can re-order or add 
to assumptions. 

The first negation rule incorporates the commonly used method of proof 
by cases. In order to conclude g from I- one first considers the case where a 
condition yw holds and then treats the case where “yw holds. That is, one 
first has y and then “yw as an additional assumption. We can translate this 
argument into a rule for sequents as follows: 


2.3 Proof by Cases Rule (PC). 


-L vo 
IT any @ 
T po 


CORRECTNESS. Suppose PW — g and L mW Fe @ hold. We must show that 
I~ @. Let 3 be any interpretation such that 3—E IT, 1e., 3 x for every 
member ¥ of I. Either 3 Ww or 3 Tw. If 83 w then since DWE @ it 
follows that 3 @. If 3 “yw one obtains the same result because 
law Ee @. L 


As the second rule concerning negation we take the schema (+ +) given 
in &1- 


RIT 
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2.4 Contradiction Rule (Ctr). 


Tr 79 w 
Il 79 Ty 
T Q 


CORRECTNESS. Let TF 4@E wy andl 4@ xy. Then there is no interpre- 
tation satisfying I 1@; hence any interpretation satisfying I’ must satisfy ¢, 
1.¢., 1’ @ is correct. 


2.5 v-Rule for the Antecedent ( v A) 


l @ x 
rT y x 
Tievyp)yx 


The proof that this rule is correct is similar to that for (PC). 


2.6 v-Rules for the Succedent (v S) 


(a) I g _ ©) Te 

T(gevwy) r@vg) 
CORRECTNESS. Suppose F — ¢ and let 3 T, Then 3— @ and hence both 
3 (g v W) and 3E (Y v ¢). a 


2.7 Exercise. Decide whether the following rules are correct: 


@MEa Wi (b) TI gy Wy 
T Wo ; Y @2 W2 
T (#1 Vv 2) V Wo) I (gi Vv Gai A Wo) 


| §3. Derivable Connective Rules 


Using the rules of & which we have formulated so far, we derive a number 
of sequents and introduce the notion of a derivable rule. In our first example 
we show that all sequents of the form (g@ v “@) are derivable. Our notation 
is similar to that used for derivations in previous calculi (cf. Chapter II, §3). 


lg @ (Ass) 
2.0 (~v 1) (VS) applied to | 
(+) 3, 19 19 (Ass) 


4. r@ (e Vv a@) (v5) applied to 3 
5. (@ V 1@) (PC) applied to 2 and 4. 
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We consider the rule (TND)("Tertium non datur" or ''Law of the excluded 
middle”) 


(pv 49) 


which is not a rule of 6. If we add (TND) to S we do not enlarge the set of 
derivable sequents. For if we are given a derivation of a sequent which uses 
rules of S together with (TND), we can insert lines 1-4 of (*) directly before 
every sequent (y~ v —@), which originally was introduced by (TND). In 
this way we obtain a derivation in ©. 

Rules for sequents, whose use in a derivation can be eliminated by a 
derivation schema like («), and which therefore do not enlarge the set of 
derivable formulas, will be called derivable rules. Thus (TND) is a derivable 
rule. The use of such derivable rules contributes to the transparency of 
derivations in the sequent calculus. In the remainder of this section we give 
some useful examples, also including derivable rules with premises. 


3.1 Second Contradiction Rule (Ctr'). 
ry 


JUSTIFICATION. (The justification shows that the rule is derivable. In this 


case we have to show how one can use rules of © to obtain the sequent I’ 
from (the “premises”) Cy and Fr 7§.) 


L7 17 premise 


l yy premise 


2: 

3. F a@ w (Ant) applied to | 
4.1 a4@ aw (Ant) applied to 2 
5: 


l — (Ctr) applied to 3 and 4. 


3.2 Chain Rule (Ch). 


l @ 
low 
ry 


JUSTIFICATION. 

1 Pp premise 

2T@o w premise 

3.1 7@g @ (Ant) applied to 1 
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4.1 4@ 7 7¢@ (Ass) 
5. TF a9 (Ctr') applied to 3 and 4 


6. T Ww (PC) 


applied to 2 and 5. 


3.3 Contraposition Rules (Cp). 


CT ge wy. 


(a) Pa se. 


JUSTIFICATION OF (a) 


(b) 


rc py @ 


lI @ Wy premise 
2T 7y @ wW (Ant) applied to 1 
3.0 aw @ naw (Ass) 
4.0 74” @ —“@ (Ctr') applied to 2 and 3 
5. TF aw 49 a@_ (Ass) 
6.T ay 1g (PC) applied to 4 and 5. 
3.4 
T (9 v W) 
Tl 7@ 
ry 
JUSTIFICATION 
1.7 (g v w) premise 
2 mak) premise 
3. W (Ass) 
4. @ 79 (Ant) applied to 2 
5. Tl @ p (Ass) 
6. T @ Wy (Ctr') applied to 5 and 4 
7.T (ovww (v A) applied to 6 and 3 
8. 1 W (Ch) applied to | and 7. 


3.5 “Modus ponens”’, 


T (p>) lr (navy 
a that is, = 
ry ry 


ro ne ny. Cee 
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The following justification of 3.5 is analogous to the one given for 3.4. 


JUSTIFICATION 

17 (4@ v w) premise 

Qed p premise 

3. T 19 Q (Ant) applied to 2 

4.1 7@ @ (Ass) 

XL =1e W (Ctr') applied to 3 and 4 
6. Tw yy (Ass) 

7.T (n@v wh) w ( v A) applied to 5 and 6 
8. Wy (Ch) applied to | and 7. 


Using the preceding rules we obtain: 


3.6 Lemma. The following sequents are derivable 


(al) g (ev Ww); (a2) yw (p v W); 
(b) (9vwy)rgy; (€) (NMevphey. 


PROOF. For (al): 

l@e@ (Ass) 

2. pe (p v Ww) (vS) applied to 1. 

(a2), (b), and (c) can be proved similarly by using (Vv S), 3.4, or 3.5, respec- 
tively. LI 


3.7 Exercise. Show that the following rules are derivable. 


ro rT 77@ 
(al) Prone i ara 
(b) To Yr 
T yw rp 
fen: os Fae va)’ 
T (paw) l (paw). 
(cl) re —; (c2) ry ” ; 
(d) Tow 


$4. Quantifier and Equality Rules 


§4. Quantifier and Equality Rules 


Now we give two sequent rules of © which involve the existential quantifier. 
The first is a generalization of a scheme already mentioned in the introduction 
to this chapter. 

4.1 Rule for 3-Introduction in the Succedent (3S). 


t 
T x 
°x 


T Axg 


(SS) says, in essence, that we can conclude Jx@ from I if we have already 
obtained the "witness" ¢ for this existence claim. 


t : ‘ 
CORRECTNESS. Suppose I — ae Let 3 be an interpretation such that 


3" TI. By assumption 3 — o- holds. Then by the substitution lemma, 


s3(t 
3 “O k= ~ and hence 3 — 4x¢ also. CL 


The second 3-rule is more complicated, but it incorporates a method of 
argument that is used frequently. The aim is to prove aclaim w from assump- 
tions @g,---; Py—1, 4XxY. (On our formal level to achieve the corresponding 
aim requires a derivation of the sequent 


(*) Po ++. Pn-19X@ 
in the sequent calculus.) 

According to the hypothesis 4xg, one assumes one has an example— 
denoted by a new variable y—which "satisfies ~” and uses it to prove §. 
(In the sequent calculus this corresponds to a derivation of 


y 
(+*) Po--. Pn-1P — Ws 
where y is not free in (*).) Then one regards y& as having been proved from 
Por+++5 Qn—1» IX’. We can reproduce this argument in the sequent calculus 
by a rule which allows us to proceed from (**) to (x): 
4.2 Rule for 3-Introduction in the Antecedent (3A). 
y 
I 9 - Wy 
race if y is not freein TD 4x wp. 


' Cf. the proof of [,1.1. with the use of y in line (1). 
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CORRECTNESS. Suppose Perr Ww, y is not free in IT Sxmw, and the 


interpretation 3 = (YI, £) is a model of I dx. We must show that 3 yw. 


' a ; ae 
First, there is an a€éA such that 3- = @. Using the coincidence lemma 
x 


a 
we can conclude 3*\5 F g. (For x = y this is clear; for x # y note 
that y ¢free(~) since otherwise yefree(3x@m) contrary to assumption.) 


Because 3 : (y) = a we have 


y 
and hence by the substitution lemma 3 E go -. From 3 & [and y ¢ free(T) 
x 


ake ; A 
we get 3 : - I, again by the coincidence lemma; since Ig —  w we obtain 
X 


3° E= w and therefore 3 — w because y ¢ free($). = 
y 


The condition on y in (JA) is essential. For example, the sequent 
[x= fy]=y = fy is correct; however, the sequent ixx=fy y= fy, 


which we could obtain by applying (4A) while ignoring this extra condition, 
is no longer correct. This can be verified, say, by an interpretation with 
domain N, which interprets fas the successor function nt n + | and y asQ. 


| rare F 3 ; 
From a formula @— it is not in general possible to recover either @ 
x 


or ¢. For instance, the formula Rfy can be written as Rx or as Rfx2 ; 
x x 


Therefore, in applications of the rules (3S) and (3A), we shall explicitly 
mention g and ¢ or ¢ and y if they are not clear from the notation. 

The last two rules of © arise from two basic properties of the equality 
relation. 


4.3 Reflexivity Rule for Equality (=). 


~~ 
Il 
~~ 
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4.4 Substitution Rule for Equality (Sub). 


Yr Qo 
x 
t’ 
lr=t' @- 
x 


t 
CORRECTNESS. (= ): trivial. (Sub): Suppose T = @ - and suppose 3 satisfies 


t 
T and t=r. Then 3 es and hence, by the substitution lemma, 


x} t 
30. g; therefore since 3(t) = 3(t’) we have 3 ©) 


Eg. A further 
x 


f 


t 
application of the substitution lemma yields finally that 3 — ~ z LJ 


4.5 Exercise. Decide whether the following rules are correct: 


Sg: ee P ixp axy’ 
ro Lv 
(c) ry = , if fis unary, and f and y do not occur in I ¥x@. 
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Since o- = ~, we obtain from 4.1 and 4.2 (for t = x and y = x) 


§.1. 


Faxe W’ if x is not free in T y. 


A corresponding special case of (Sub) is 


5.2. 
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We conclude with some derivable sequents dealing with the symmetry 
and the transitivity of the equality relation and its compatibility with func- 


tions and relations. 


5.3. 


5.4. 


Kip =t, ty 


5.5. For every RES, R n-ary, 


t Rtg..-th—4 to = to fi, =f 


5.6. For every f € S, f n-ary, 
Kh =tp ty =U 
JUSTIFICATION OF 5.3 THROUGH 5.6 


5.3: Let x be a variable not occurring 


(=) 


1. fo = lo 


iS) 
~ 
Qo 
Ul 
~ 
~ 
a 
Ul 


ty—1 = ta-1 


thay = ty-1 


in fo or fy. 


5.4: Suppose x is a variable not occurring in tg, t;, or f2. 


l. tg =f, to =t, (Ass) 


Hl 


2.t9=t, t, =t, 


Rigs 


ta 
n-1- 


Ae csulgeg = flo sll ad 


f 
= (Sub) applied to 1 using to = tg = [x= to] - 


t 
t, (Sub) applied to 1 using tp =t, = [to = x] = 


5.5: For simplicity we assume that n = 2. Let x be a variable which does not 


; 
occur In tg, t,, to, OF ¢4. 


1, Rtoty Rtoty 
2. Rtgpt; to = to Rtot, 
3. Rtot; to=to ty =t, Root 


5.6 can be treated similarly. 


(Ass) 
(Sub) applied to | using 


t 
Rtot, = [Rxt, ] _ 


(Sub) applied to 2. using 


f 
Rtot, = LRtox] = 
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5.7 Exercise. Show that the following rules are derivable: 


ane: rev 
(al) 2 = that is, ———*; (a2) L ae 
re 
LOS es 
£ y 
lo- wy 9 > 
(b1) Pye (b2) T vxo’ # y is not free in l yx@; 
3)? % aay i. if x is not free in I 
Vr Vx $> Vx 
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For the reader's convenience we list all the rules of & together: 


r 
(Ass) FG ifg isin (Ant) re flo. 
r ye rng wy 
(PC) T 7w @ (Ctr) T ne aw 
r Q IT ~ 
Tr r 
(VAT go x (vs) a Q 
So ae. yy rigvy) TW v@) 
Pievwx 
y t 
ox ee 
(4A) Er oxo’ if y is not free in Ix Us (IS) Tr Ix 
T 9 ~ 
a mer (Sub) i 
Tt=t ae 


In 1.1 we defined a formula ¢ to be derivable (formally provable) from ® 
(written: ®t @) if there are formulas @,...,9,.; in ® such that 
I Qo... @,~1@. From this definition we immediately obtain: 


6.1 Lemma. For all ® and g, ®}+ @ ifand only #f there is a finite subset Dy 
of P such that By F ¢. CL 


70 IV. A Sequent Calculus 


We have already more or less proved the correctness of S: 


6.2 Theorem on the Correctness of S. For all ® and g, # ® I- g, then Dt g. 


Proor. Suppose ® L- @. Then for a suitable T from ® (that is, a I whose 
members are formulas from ®) we have I-IT g. As we showed, every rule 
without premises yields only correct sequents, and the other rules of © 
always lead from correct sequents to correct sequents. Thus, by induction 
over &, we see that every derivable sequent is correct, hence also I g. There- 
fore  — mand so @E @. LI 


We shall prove the converse of 6.2, namely, "if ® = g then ® E¢—”, in 
the next chapter. In particular, it will follow that if @ is mathematically 
provable from ®, and hence ® & 4g, then ¢ is alsoformally provable from ®. 
However, because of the elementary character of the rules for sequents, a 
formal proof is in general considerably more complicated than the corre- 
sponding mathematical proof. As an example we give here a formal proof 
of the theorem Vx Jy yo x = e (existence of a left inverse) from the group 
axioms My = VX Vy ¥Z(Xoy)ozZ = X°o(yoZ), py = Vx xc°e =X, and @2 = 
Y¥x dy xoy =e. For simplicity we shall write xy instead of x o y. The reader 
should compare the formal proof below with the mathematical proof of the 
same theorem in I.1.1. The chain of equations given there corresponds to the 
underlined formulas in the derivation. For easier reading we use the de- 
rivable rules: 


Lt, =t Lt,=t 

Svm 0 1 0 1 
(Sy ae =i and (Trans) ee: 
Tl to =t 


The reader can easily justify them by means of 5.3, 5.4, and (Ch). 


1. Mo Yr G2 Vx xe =X (Ass) 

2. Po Pi 2 (yx)e = yx 5.Hal) 
applied to 1 
setting t = yx 


3. Po Pi P2 yx = (yx)e (Sym) 
applied to 2 

4. Po Pi P2 @ = yz yx = (yx)(yz) (Sub) 
applied to 3 

5. Po Pr Pr YZ =e e= yz 5.3 and (Ant) 

6. Po Pi Pr YZ =e yx = (yx)(yz) (Ant) and 
(Ch) applied 
to 5 and 4 

7. Po Pi P2 V2 =e Vx Vy Vz (xy)z = x(yz) (Ass) 

8. Yo YO, Pr yZ =e Vu Vz (yu)z = y(uz) 5.7 (al) 


applied to 7 
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9. Po Py Y2 


10. 


11. 


12. 


13. 


pal 


22. 


23. 


24. 


25. 


Po 


Po 


Po 


Po 


- Po 


Po 


Po 


Po 


Po 


Po 


Yi 


Py 


Py 


Py 


Py 


P1 


P2 


P2 


P2 


P2 


P2 


P2 


Po 


Pr 


Pr 


P2 


P2 


Po 


P2 


P2 


Pr 


P2 


yZ=e 
yz=e 
yz2=e 
yz =e 
yzZ=e 
yz =e 
yz =e 
yZ=ec 
yZ=ece 
yZ =e 
yz=e 
yZ=e 
yZ=e 
yZ =e 
xXxy=e 
xy=e 
xy=e 


x(yz) = (xy) 
xXy=e 

xy =e 
xy=e 
xy=e 

xy =eye= 
xy =e 
Xy=ze 
yeSe 

yZ =e 
dzyz=e 


"> 


a 


Vz (yx)z = (xz) 


(yx)(yz) = (x(yz)) 


yx = yW(x(yz)) 


yx = y((xy)z) 


(xy)z = x(yz) 


x(yz) = (xy)z 
yx = y((xy)z) 
xXx = eZ 


(ye)z = y(ez) 


y(ez) = (ye)z 


yx = (ye)z 
yx = yz 
ye =y 
yx = yz 
yx =e 
dy yx =e 
dy yx =e 


7 


5.7(a1) 
applied to 8 
setting t = x 
5.7(a1) 
applied to 9 
settingt = yz 
(Trans) 
applied to 6 
and 10 
(Sub) 
applied to 11 
5.7(a2) 
applied 
three times 
to 7 

(Sym) 
applied to 13 
(Ch) applied 
to 14 and 12 
(Sub) 
applied to 15 
5.7(al) 
applied 
three times 
to @q; like 
steps 7-10 
(Sym) 
applied to 17 
(Trans) 
applied to 16 
and 18 
(Sub) 
applied to 19 
5.7(al) 
applied to I 
setting t = y, 
and (Ant) 
(Ch) applied 
to 21 and 20 
(Sub) and 
(Ant) applied 
to 22 

(48) applied 
to 23 

(JA) applied 
to 24 
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26. 


27. 
28. 


29. 


30. 


31. 


32. 


33: 


34. 


Yo PY, G2 xy =e Vy Az yz =e 


xy =e 
xy =e 


jyxy =e 
Vx dyxy =e 


P2 


Po YP: P2 XY He 


Yo Pi G2 Vx Ayxy =e 


Po P1 P2 


dy yx =e 
xy=e 
dzxz=e 
dzxz=e 
dzxz=e 
Vy dzyz =e 
dy yx =e 
dy yx =e 
Vx dy yx =e 
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5.7(b3) 
applied to 25 
(Ass) 

(3S) applied 
to 27 

(3A) applied 
to 28 

5.7(b3) 
applied to 29 
5.7(b2) 
applied to 30 
(Ant), (Ch) 
applied to 31 
and 26 

(4A) and 
5.7(b3) 
applied to 32 
(Ant) and 
5.7(b4) 
applied to 33 


6.3 Exercise. Following the proof of I.2.i, give a derivation for the sequent 


Wo Wi W2 Vx Vy (du(Rxu a Ryu) > Vz (Rxz © Ryz)), 


where Wo, /,;, and yw, are the axioms for equivalence relations (cf. III.6.1). 


§7. Consistency 


The syntactic concept + of derivability corresponds to the semantic concept 
E= of consequence. As a syntactic counterpart to satisfiability we define the 


concept of consistency. 


7.1 Definition. (a) ® is consistent (written: Con ®) if and only if there is no 
formula g such that B+ mand D+ 7¢@. 
(b) ® is inconsistent (written: Inc ®) if and only if ® is not consistent (that 


is, if there is a formula g such that Wt mand M+ “@). 


First we show that from an inconsistent set one can derive any formula. 


7.2 Lemma. For a set d formulas ® thefollowing are equivalent: 
(a) Inc ®, 


(b) For ally, DE g. 
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PROOF. (a) follows immediately from (b). Suppose, on the other hand, that 
Inc ® holds, ie, @ EH Wand DE Twp for some formula yw. Let @ be an arbi- 
trary formula. We show ®t 4. First of all there exist I-, and I, consisting 
of formulas from ® and derivations 


and 
r, yp Pr, ay 


By combining these two derivations and adding to the resulting derivation, 
we obtain 


mM. Tr, w 


n. Tr, 7p 
n+i.7T,F, yw (Ant) applied to m 
n+2.1,0, aw (Ant) applied to n 
n+3.1,F,@ (Ctr) applied ton + 1 and nt 2. 
Thus we see that PF 9g. O 


7.3 Corollary. For a set of formulas ® the following are equivalent: 
(a) Con ® 
(b) There is a formula @ which is not derivable from ®, LO 


Since ® + g if and only if By t+ @ for a suitable finite subset Dg of P, we 
obtain: 


7.4 Lemma. For all ®, Con ® if and only if Con ®p for all finite subsets Do 
of ©, L 
7.5 Lemma. Every satisfiable set of formulas is consistent. 


PRooF. Suppose Inc ®, Then for a suitable g both + @ and O+ 19g; 
hence, by the theorem on the correctness of S, ® E gm and MOE “Tq. But 
then ® cannot be satisfiable. LJ 


Later we shall need 


7.6 Lemma. For all ® and 9, 


(a) ff not Dt g, thenCon®u gpl = 
(b) f Con Pand D+ @g, then Con Du {oe}; 
(c) if Con ®, then Con ® u {¢} or Con ®u {7g}. 
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PRooF. (a) Suppose not P+ g, but ® u {1g} is inconsistent. Then for a 
suitable I consisting of formulas from ®, there is a derivation of the sequent 
I 7@ g. From this we obtain the following derivation: 


I @ 9 
Ig @ (Ass) 
Tr g (PC) 


Hence ® [- g, a contradiction. 

(b) Interchange the réles of g and “1g in (a) and note that ®t “@ does 
not hold. 

(c) This follows directly from (a) and (b). L 


In this chapter we have referred to a fixed symbol set S. Thus when we 
spoke of formulas we understood them to be S-formulas, and when dis- 
cussing the sequent calculus S we actually referred to the particular calculus 
Ss corresponding to the symbol set S. In some cases it is necessary to treat 
several symbol sets simultaneously. Then we insert indices for the sake of 
clarity. To be specific, we use the more precise notation ® t-, @ to indicate 
that there is a derivation in & , (consisting of S-formulas) whose last sequent 
is of the form I g, where I consists of formulas from ®, Similarly, we write 
Con, © if there is no S-formula ¢ such that ® t-, gand Ot-, 4@.? 

In the next chapter we shall need: 


7.7 Lemma. For ne N, let S, be symbol sets such that Sp < S$} $8, C..., 
and let ®,, be sets of S,- formulas such that Cons, @,and Mp < ®, © ®, c.... 
Let S = (Jnrexy S, and ® = | J,c4 ®,. Then Con, ®. 


PRoor. Assume the hypotheses of the theorem, and suppose Inc, ®. Then, 
by 7.4, Incs ‘¥ must hold for a suitable finite subset ‘¥ of ®. There is a k such 
that ‘Y¥ < ®, and hence Inc, ®,; in particular, ®, I-, v9 = vp and ®, K, Tu 
= Uo. Suppose we are given S-derivations for these two formulas. Since they 
contain only a finite number of symbols, all the formulas occurring there are 
actually contained in some L*=, We may assume that m= k. Then both 
derivations are derivations in the S,,-sequent calculus and therefore Incs_ ®,. 
Since ®, c ®,, we then obtain Inc, » ®,,, which contradicts the hypotheses 
of the theorem. 


2 The reader should note that for two symbol sets S' > S, and for ® c L‘ and geé L*, it is con- 
ceivable that @ + ,. @ but not @ Ks q, for it could be that formulas from L* — L* are used in 
every derivation of @ from P in Ss;., and that (later on in the proof) these formulas are then 
eliminated from the sequents, say by application of the rules (Ctr), (PC), or (JS). In V.4. we shall 
show that this cannot, in fact, happen. 
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78 Exercise. Define (SV) to be the rule 


I jxg Vxp 


(a) Determine whether (4¥) is a derivable rule. 
(b) Let 6' be obtained from the calculus of sequents © by adding the rule 
(SV). Is every sequent derivable in ©’? 


CHAPTER V 
The Completeness Theorem 


The subject of this chapter is a proof of the completeness of the sequent 
calculus, 1e., the statement: 


(*) For all ® and g, if DE g then D+ @¢. 
In order to verify (*) we show 
(**) Every consistent set of formulas is satisfiable. 


From this, (*) can be proved as follows: We assume for ® and g that ® = @, 
but not ®+ g. Then ® vu {4¢} is consistent (cf. Chapter IV, 7.6(a)) but not 
satisfiable, a contradiction to (*#*). 


To establish (**) we have to find a model for any consistent set ® of 
formulas. In §1 we shall see that there is a natural way to do this if ® is 
maximally consistent and if it contains witnesses. Then we reduce the general 
case to this one: In $2 for at most countable symbol sets, and in $3 for arbi- 
trary symbol sets. 

Unless stated otherwise, we refer to a fixed symbol set S. 


1 Henkin’s Theorem 


Let ® be a consistent set of formulas. In order to find a model 3 = (YI, f) of 
®, one can only use the "syntactical" information given by the consistency 
of ®. Hence we shall try to obtain a model using syntactical objects as far 
as possible. A first idea is to take as domain A the set of all S-terms, to define 
B by B(v;) = v;(iEN) and R” say for unary R, by R” = {te A|Rte®}. 
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Then, if for instance Rx € ® and Rx > Ry € ®, we should have Ry € ®, and 
if 3x Rx € ® there should be a "witness" t, i.e. a term t such that Rte ®. 
We see that in order to get a model of ® in this way, ® has to satisfy certain 
closure conditions. These are made precise in the following definition. It 
will turn out that they are sufficient to carry out the above idea. 


1.1 Definition. Let ® be a set of formulas. 


(a) ® is said to be maximally consistent if and only if Con ®, and if every 
formula g with Con ® u {g} already belongs to ®. 
(b) ® contains witnesses if and only if for every formula of the form dx@ 


2 t 
there exists a term ¢ such that (3x0 > 4 E®, 
x 


If 3 is an interpretation, then the set ® = {ge L5]3 = } is maximally 
consistent: Since 3 = ®, ® is satisfiable and hence by IV.7.5 it is consistent. 
Further, if Con ® u {g}, then @¢®; hence 3 mand so gE, 

Conversely, with every maximally consistent set ® which contains wit- 
nesses we shall associate an interpretation 3, as outlined above such that 
So E= ©. (Thus it turns out that every such © is satisfiable.) 


1.2 Lemma. Let ® be maximally consistent and contain witnesses. Then, for 
all g and y: 


(a) If DE g, then ge ®. 

(b) Either g € Dor gE. 

(c) (g v Ww) Ee @ if and only if pe D or PEO. 
(d) If (@> fp) Ee @and gE, thenpe ®. 


, t 
(e) ixpe Of and only if there is a term t such that p—€®. 
x 


PRooF. (a) If D+ g, then by IV.7.6(b), Con ® u {g}, and hence gé¢ ® since 
® is maximally consistent. 

(b) By IV.7.6(c) we have Con ® u {eg} or Con ® u {74}, and therefore 
Pe or ame. Since D u {¢g, 1¢g} is inconsistent, g and 1@ cannot 
both belong to ®. 

(c) Suppose first that (gv W)e®. If @¢@M then r@e®. Since 
-(g v i) 4@ } (cf. IV.3.6(b)), we have ® F yf and from (a), yy € ®. On the 
other hand, if, for example, g € ®, then by IV.3.6(al), PF (g v wW), and so 
by (a), (9 v We®. 

(d) Assume that (g > /) (Le. (4@ V )) and @ belong to ®. Since 
K (79 v Whew (cf. IV.3.6(c)), we obtain by (a) that yf belongs to ®. 

(e) First suppose that 4x € ®, Since ® contains witnesses, there is a term 

\ 


t such that [Sxg—>@ _ €®9, and therefore o E® by (d). On the other 
Xx 


‘ 


hand, if @ e@®, then Dt Ax¢@ (use (35)) and by (a), IxpeEO. El 
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From 1.2 we shall obtain the result that for an interpretation 3, the state- 
ment 


(*) sE@g iff pEe® 
holds for all g (and hence 3 = ®) provided we can establish (1) and (2): 


(1) (*) holds for atomic g. 
(2) For every element of the domain of 3 there is a term t such that S(t) = a. 


Taking up our original idea we construct an interpretation 3 satisfying 
(1) and (2). For the domain of 3 we intended to take the set of terms, and to 
arrange the interpretation so that 3(t) = t (cf. (2)) and (St Rtg... t,-| 
iff Rtg ...t,-,€@®) (cf. (1)). A slight difficulty arises concerning equations: 
If tp = t, € ®, then on account of (*), (tg) = 3(t,) must hold even if ty and 
ft, are distinct terms. We overcome this difficulty by defining an equivalence 
relation on terms and then using the equivalence classes rather than the 
individual terms as elements of the domain of 3. 

In the remainder of this section let ® be a maximally consistent set con- 
taining witnesses. We proceed to define the interpretation 3g =% fg). 
First of all, we introduce a binary relation ~ on the set T* of S-terms: 


1.3. lo in ty iff lo i ty E . 


1.4 Lemma. (a) ~ is an equivalence relation. 
(b) ~ is compatible with the symbols in S in the following sense: If 
to ~ lor...5ta~y ~ ty—y, then for n-ary f eS, 


Slovene flo tex tei 
and for n-ary Re S, 


Rty...t,-,ED iff Rto...t,.,€@. 


The proof uses the rule (=) and IV.5.3-5.6. We give two cases as examples: 


(1) ~ issymmetric: Suppose ty ~ f,, that is,tg = t, € ®. By IV.5.3 we obtain 
Mt t, = to; hence, by 1.2(a), t; = to ED, 1e., ty ~ ty. 

(2) Let f be an n-ary function symbol from S$, and assume ty ~ &,..--; 
tri ~ta-1, Le, to = toe ®,...,t,_5 =t,-,EP. Then by IV.5.6, 
Ob fty...ty-1 = fto--.t,_;, and by 1.2(a), fig-..b-1 ~ fto-.-th-1 OF 


Let f be the equivalence class oft, 
f:= {t'E T*[t ~ f}) 
and let J be the set of equivalence classes, 


Ty = {@|t € TS}. 
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Ty is not empty. Define the S-structure 2, over Tg, the so-called term structure 
corresponding to ®, by the following clauses: 


15. For n-ary R eS, 
Ri°iy...8,-1 iff Rto...t,- EO. 
1.6. For n-ary f €S, 
Plessis) = Lipisetn sts 


1.7. Force S, 


Ce i=C, 


By 1.4 the conditions in 1.5 and 1.6 are independent of the choice of the 
representatives t9,...,t,—1, hence R*® andf *® are well defined. 
Finally, we fix an assignment Bq» by 


1.8. Bolx) =X; 
We call 3g = (Tp, Be) the term interpretation associated with E. 


1.9 Lemma. (a) For all t, 3g(t) = f. 
(b) For every atomicformula g, 


Joep iff pe®. 


PROooF. (a) By induction on terms. The lemma holds for t = x by 1.8 and for 
t=cbyl.7.Ift = ftp... t,-1, then 


Jo(fto... tri) = f**(Solto),..., Soltn-1)) 
= f7*(i5,...,f,-1;) (by induction hypothesis) 


=ftg..-th-4 (by 1.6). 
(b) Sotto =t, iff Bo(to) = So(t,) 
iff fp = f, (by (a)) 
iff lo ae Uy 
iff tp =t, €@. 
So Reet tag. tt ROR gt ig 
iff Rto...t,-1€@® (by 1.5). 1 


1.10 Henkin’s Theorem. Let ® be a maximally consistent set containing 
witnesses. Then for all 9, 


(*) Soe Qo if pE®. 
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PRooFr. We show (*) by induction on the number of connectives and quanti- 
fiers in ~, in other words, by induction on rk(g) (cf. H1.8.6). If rk(@) = 0, then 
@ is atomic, and (*) holds by 1.9(b). The induction step splits into three 
separate cases. 


(1) @=7y: 
Sok Ty iff not Sore wp 
iff p¢® (by induction hypothesis) 
iff ape® (by 1.2(b)). 
(2) p= v w): 


3E( vy iff SE Yyora Fx 
iff ye@MoryeE® (by induction hypothesis) 


iff (Y v ye (by 1.2(c)). 
(3) g = dx: 
So f= dxw 


f 
iff there is a term t such that 3, — = w 
x 


So(t 
iff there is a term t such that 3, ae) E= (by 1.9(a)) 


iff there is a term ¢ such that 3g k= - (by the substitution lemma) 
; ; t 
iff there is a term f such that y = E® 


(by induction hypothesis since r(u :| = rk() < rk(¢) (cf. IIL8.7)) 
iff Ixye® (by 1.2(e)). C 


1.11 Corollary. Let ® be a maximally consistent set containing witnesses. 
Then Sq t= P and therefore ® is satisfiable. [e) 


§2. Satisfiability of Consistent Sets of Formulas 
(the Countable Case) 


By 1.11, every maximally consistent set of formulas containing witnesses is 
satisfiable. We prove that any consistent set ® of formulas is satisfiable by 
showing how to extend it to a maximally consistent set containing witnesses. 
In this section we settle the case of symbol sets which are at most countable. 
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In the following let S be at most countable. First we treat the case where only 
finitely many variables occur free in ®, i.e., where free(@):=|_),-@ free(@) is 
finite. 


2.1 Lemma. Let ® & L* be consistent and let free(@ )bejinite. Then there is a 
consistent set ‘¥ such that ® < ‘¥ c L‘ and ¥ contains witnesses. 


2.2 Lemma. Let ‘Y « L* be consistent. Then there is a maximally consistent 
set Owith F< @e L’*. 


2.1 and 2.2 enable us to extend a consistent set ® of formulas in two stages 
to a maximally consistent set of formulas containing witnesses. First of all, 
we extend ® to according to 2.1, and then Y to © according to 2.2. © is 
maximally consistent, and it contains witnesses because Y¥ does. Hence by 
1.11, @ is satisfiable, and, since DB c ©, Mis also satisfiable. To summarize we 
obtain: 


2.3 Corollary. Let ® be consistent, and let free(@) be finite. Then ® is 
satisfiable. L 


It still remains to prove 2.1 and 2.2. 


PROOFOF LEMMA 2.1. By II.3.3, L* is countable. Let 3x9 @o, 3x;@1,... be a 
list of all formulas in L> which begin with an existential quantifier. Inductively 
we define formulas wo, ;,..., which weadd to ®, For eachn, y, is a "witness 
formula" for 4x, @,.- 

Suppose ,, is already defined for m <n. Since free(@) is finite, only 
finitely many variables occur free in ® u {y,,|m <n) uw {3x,@,}. Let y, bea 
variable distinct from these. We set 


Wri= (3s, Pn > Pn 22) 
xX 


n 


Now let 


Y:=Ou {Wo, Wir. -fe 


Then ® c ¥ and ¥ clearly contains witnesses. It remains to be shown that 
P is consistent. For this purpose put 


®,:=Ou {y,,|m <n). 


Then P79 cD, cM, c... and ¥ = |J,-ny ®,. By IV.7.7 (for S = Sp = 
S, =...) the proof will be complete if we can show that each ®, is consistent. 
We proceed by induction on n. 

Since ®) = ®, Con ®,y holds by hypothesis. For the induction step we 
assume that ®, is consistent. Suppose, for a contradiction, that ®,,, = 
®, u {w,} ‘is inconsistent. Then for every @ there exists [ over ®, such 
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that + Tw, @, Le., 


Thus there is a derivation, 


mM. r(7as0, V Dy a 


which, if @ is a sentence, we can extend as follows: 


m+1. VT 44x,Q, T5x,Qr (Ass) 


m+ 2. T 74x,.@, (>a:,0, V Dn 2) (v S) applied to m + 1 


n 


m+ 3. T 744x,9, (Ch) (with (Ant)) applied to m + 2 
and m 
m+4TI @, Zs Q (analogously) 
Xn 
m+ 5. T 4x,@, © (3A) applied to m + 4(y, does not 
occur free in I 4x,,@,, ¢) 
m+ 6. —T 7) (PC) applied to m + 5 and m + 3 


Hence we have ®, + q. But then @, is inconsistent, as can be seen by taking 
@ = Avo Vp = Vg and ~ = A4Avg Vg = Ug. This contradicts the induction 
hypothesis. LI 


PROOF OF LEMMA 2.2. Suppose ‘PV is consistent and let @o, @;, @2,... be an 
enumeration of L°, We define sets of formulas @, inductively as follows: 


Oo = VY, 

and 
Be et i. U {o,} if Con @, v {¢,} 
n+] 07 : 
n otherwise, 

and we set 

© = |) ©,. 

neN 


First of all, ¥ < @. Clearly all ©, are consistent, and hence by IV.7.7, @ is 
consistent as well. Finally, @ is maximally consistent. For if ge L*, say 
@ = ¢,, and if Con © vu {¢,}, then, since ©, < ©, we obtain Con ©, vu {¢g,} 
and hence ¢,, € ©,41, 1.€., P, € ©. LJ 
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Now we drop the assumption that free(@)is finite. 
2.4 Theorem. F S is at most countable and ® c L* is consistent, then ® is 
satisfiable. 


PRoor. We reduce 2.4 to 2.3. Let cg, c,,... be distinct constants which do 
not belong to S, and set 


SP SA Cas Cis ok 


For ge L* denote by n(@) the smallest n such that free(@) c {t9,...,U,—4}- 
Let 


0... &n(g)—-1 


c 
(1) P= and D'= {|g eG}. 


Vo wee Unig) - 1 
First, by IIL.8.5, 


(2) free(®’) = @. 


Now it will suffice to show that 


(3) Cong, ®’, 


for then we know from the special case proved in 2.3 that ®' is satisfiable, 
say by some interpretation 3 = (QI’, f’). Using the coincidence lemma we 
can, by (2), assume that f’(v,) = c4, ie, 3’(v,) = ¥(c,) for all ne N. Then 
from (1) and the substitution lemma it follows that SV is a model of ®; hence 
® is satisfiable. 

We prove (3) by showing that every finite subset o of ®’ is satisfiable, 
and thus, by IV.7.5, consistent (with respect to S'). Let ®) = {@,..., @r— 135 
where Wo,---; Pa-1 € PD. Since {@p,..-, @,-;} is a subset of ® it is consistent 
(with respect to S), and since only finitely many variables occur free therein, 
it is satisfiable (cf. 2.3). 

Choose an S-interpretation 3 = (W, B) such that 


(*) St= {Qo,---: Pn~1} 


and expand Y to an $’-structure QW’ with cj" = 3(v,) for ne N. From (1), 
(*), and the substitution lemma, it then follows that the S’-interpretation 
(Q’, 8’) which results is a model of ®%. LJ 


The following exercise shows that the assumption "'free(@) is finite" in 
2.1 is necessary. 


2.5 Exercise. Let S be arbitrary and let 
® = {v9 =tl|te TS} U (3v9 Jv, T9 = 0;}. 


Show that Con ® holds and that there is no consistent set in L° which in- 
cludes @ and contains witnesses. 
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2.6 Exercise (A Special Case of the So-called Herbrand Theorem). Let S be 
a symbol set, and let g and w be quantifier-free formulas in which there is at 
most one free variable, namely x. Show that if 


Vx@ & Ixy, 
then there is an n > | and there are S-terms fg, ..., f,~ 1, Sg, -+-5Sn—, Such 
that 
t Eee S Sax 
QA A OE pevev not 
Es x x 


(Hint: Give an indirect proof making use of the set 


Bs» 4; | ae 
\¢ 5 lt is an Sterm| U a —|tis an s-erm|. 
x 


§3. Satisfiability of Consistent Sets of Formulas 
(the General Case) 


In this section we no longer assume that $ is countable. The réle of 2.1 and 2.2 
will be taken over by 3.1 and 3.2. 


3.1 Lemma. Assume ® e L* and Cong ®, Then there is an S’' > S and a ¥ 
such that ® < ‘¥ < L*' and Con,, , and ¥ contains witnesses with respect 
to S' (i.e., for every formula of the form 3x € L* there is a term t € T* such 


‘ 
that (3x0 > o:) ef). 


3.2 Lemma. Assume ‘¥ ¢ L® and Cons '¥. Then there is a set © such that 
Yo @c LU and @ is maximally consistent with respect to S. 


We obtained 2.3 from 2.1 and 2.2; likewise we have from 3.1 and 3.2 the 
following: 


3.3 Corollary. [f ® ¢ L* and Con, ®, then ® is satisfiable. O 


The following consideration will lead to a proof of 3.1. 
Let S be an arbitrary symbol set. Associate with every @ € L* a constant 
c, such that c, €S and c, # c, for g # w. Defining 
S* = S ut {Caxg|IX@ E L‘) 
and 
W(S) := }3v0 +O ie |Sxpe ut. 
one obtains for ® c L: 


§3. Satisfiability of Consistent Sets of Formulas (the General Case) 85 


3.4. If Con, ®, then Con,, ® U W(S). 


Proor. Suppose Cons ® holds. We show that every finite subset ®% of 
® U W(S) is consistent with respect to S* by proving that it is satisfiable. 
Let 


Cc Ch 
OF = O60 |3r0% - 04 Seng Dp 1Ont > Dee . 
Xo Xn-4 


where Dy c ®, 3x9 oq, .- +5 IXn—-1Mn—1 € L', and where ¢; stands for C4x,9, 
For a suitable finite subset Sg < 5S, we have 


Do YU {3x@o, aa AXn-1QPn—1} = 2° 


Further, since Con, ® holds, so does Con, ®y, and hence, ofcourse,Con,, Do. 
Because free(®,) is finite, it follows from 2.3 that Dg is satisfiable. 

Let 3 = (YI, £) be an S-interpretation which satisfies Py and fix an element 
a in A. In order to satisfy ®¥ we extend 3 to an S*-interpretation 3* as 
follows: For i < n we choose 4a; € A such that 


(*) Soe g;, if Si 3x0, 


and a; = a otherwise. We extend YI to an S*-structure W* by setting 
er = a; 


for i < nand interpreting the remaining constants of the formc,, by a. Let 
a* — (QI*, 8). Since no constant c,, occurs in ®o, it follows from 3 k= Do 
that 3* t= ®,. Furthermore 


Cj 
S* = 5x; 9; > Pie. 


(and this shows that @# is satisfiable). In fact, if 3* E= 4x;@; then 3* = EQ; 


by (*). Since a; = 3*(c;) it follows by the substitution lemma that 3* = 
Cj 

P:—. 
x; 

PROOF OF Lemma 3.1. Let ® c L‘ and suppose Con, ®, We define a symbol 

set S' and ‘¥ c L* with the following properties: 

(a)S$cS'andOc ©. 

(b) Con,. Y. 

(c) ‘¥ contains witnesses. 


For this purpose we define symbol sets S,, and sets ®,, of formulas by induction 
on n: 


Sgi=S and Spay) 
Do:=OD and O4,:= 0, WOS,). 
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(Concerning the definition of (S,)* and W(S,), cf. the definitions of S* and 
W(S) preceding 3.4.) 
From the constriction it follows that 


S35 S85 -S 85S wick 
®,< L® forneN, 
®=0,c0,cO,c.... 


We set S’:= ren S, and ¥ :=|Jnen ®,. Then (a) holds. Using 3.4 one can 
easily show Con, ®, by induction on n, and hence by IV.7.7, that Con,,. Se 
Therefore (b) also holds. Finally, ‘¥ contains witnesses: Suppose, forexample, 
AX@E L*’. Then, for a suitable n, 3x € L*°». Thus for some constant c € S,4 ie 


the formula (3x0 — ) is an element of W(S,) and hence an element 
x 
of ¥, C 


PROOF OF LEMMA 3.2. In the proof of 2.2 we made essential use of the count- 
ability of L*, For arbitrary S we no longer have this property at our disposal. 
We resort to Zorn's lemma, which we now state in a form suited to our 
purposes. The reader can find a proof of this lemma in books on set theory. 


Let M be a set and let HU be a nonempty set of subsets of M. % is called a 
chain in UW, if BCU, VF OS, and if for \%, V, € BV we have Vy < V; or 
V, c Vo. Then Zorn's lemma says 


3.5. If for every chain B in U the union |). V belongs to HU, then there is 
at least one maximal element in U, 1.e., an element U, for which there is no 
U,eU such that Uy SU. 


Now, for arbitrary S, let ¥ c LS and Con ¥. Set M :=L* and 
U:= {O|¥ < Oc LS and Con, O}. 


Clearly ‘¥ € U, so U is not empty. Let B be a chain in U.O, = (Jo-g Mis an 
element of U because ¥ c @, c L' and Con, O,.(Theconsistency of ©, can 
be proved as follows: If @, is a finite subset of ©,, say Mp = {@o...-, On}, 
then there are D,,..., ®,_,; € B with g;,¢ ®; for i < n. Since B is a chain, 
we can number the ®; such that Pp © ©, c... < D,_,. Thus @ — ®,_ 1, 
and by Con, ®,-, we have Con, 5.) 


Now we can apply Zorn's lemma (3.5) to U, thereby obtaining a maximal 
element © in U. From the definition of U we know that '¥ < @ < L‘ and 
Con, ©. On the other hand © is also maximally consistent. For if g¢ L* and 
Con, © u {¢}, then © u {~} e€ U; but since @ is maximal, O = © u {9}, in 
other words, gE ©. LC] 
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§4. The Completeness Theorem 


As already mentioned in the introduction to this chapter, we can obtain 
the completeness of the sequent calculus from 2.3 (for at most countable S) 
and from 3.3 (for arbitrary S): 


4.1 Completeness Theorem. For ®c L‘ and geL', if OE @g, then 
Or 5 @. L] 


From 4.1 together with the theorem on correctness (IV.6.2) we have: 
For®eL* and geL’, Of @ iff Orsg, 
and from 3.3 and IV.7.5 we obtain: 
For ® c L’, Sat ® iff Con, ®, 


In III.5 we saw that the concepts of consequence and of satisfiability are 
actually independent of the particular choice of S, It follows from the results 
above that the concepts of derivability and consistency are also independent 
of S (cf. the footnote on page 74). Thus we can simply write “} ” and 
"Con", omitting the subscript. 


4.2 Theorem on the Adequacy of the Sequent Calculus. 


(a) DE piff DE @. 
(b) Sat ® if Con ®. 


4.3 Exercise (cf. exercise [I].2.1), If one transfers the rules (Ass), (Ant), (PC), 
(Ctr), (v A), and (v S) from the sequent calculus to the language of proposi- 
tional calculus (with propositional variables po, p;, p2,... and connectives 
1 and v), one obtains a sequent calculus for propositional logic. For a set 
® u {a} of formulas in propositional logic let ® + « have a definition similar 
to the definition for first-order logic. Further, write ® = a if for every assign- 
ment s such that B{s] = T for all B € @ also af[s]| = T. 

Prove the completeness theorem (and the correctness theorem) for 
propositional logic: 


OeEa iff DE a. 


Historical Note. The completeness theorem is due to Gédel [11]. The 
program of setting up a calculus of reasoning was first formulated and 
pursued by Leibniz, although traces of it may be found in the works of 
earlier philosophers (e.g., Aristotle and Lull). At the beginning of this 
century, Russell and Whitehead developed a calculus, and within it, gave 
formal proofs for a large number of mathematical theorems. Then in 1928, 
Gédel proved the completeness theorem. The method of proof used in this 
section is due to Henkin [13]. 


CHAPTER VI 


The Lowenheim-Skolem Theorem and 
the Compactness Theorem 


The equivalences of } and & and of Con and Sat, respectively, form a bridge 
between syntax and semantics which allows us to transfer properties of + 
to and of Con to Sat. In this way we shall prove several important results 
concerning — and Sat, and at the same time we shall acquire a deeper insight 
into the expressive power of first-order languages. 


1 The Lowenheim-Skolem Theorem 


The domain of the model 3, defined in V.1 consists of equivalence classes 
of terms. We use this fact to obtain the following theorem: 


1.1 Lowenheim-Skolem Theorem. Every satisfiable and at most countable 
set of formulas is satisfiable over a domain which is at most countable. 


ProoF. First let ® be an at most countable set of S-sentences which is satis- 
fiable and hence consistent. Since each S-formula contains only finitely 
many S-symbols, there are at most countably many S-symbols in ®, Therefore 
we may assume, without loss of generality, that S itself is at most countable. 
Since Sat ® holds, so does Con ®, and the proofs in V.1 and V.2 show that 
there is an interpretation which satisfies ® and whose domain A consists of 
classes 7 of terms, where ¢ ranges over T°. Because T® is countable (cf. 11.3.3), 
A is at most countable. This argument can easily be transferred from sets of 
sentences to sets of formulas; for, if ® is a set of S-formulas and 


W = jy nenveLin ol, 
* 1 


eee n- 
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where Cg, Cy,-.., are new constants, then ® and ©’ are satisfiable over the 
same domains (cf. the proof of V.2.4). LJ 


Vx Vy x = y is a sentence which has only finite models. For a unary 
function symbol f, the sentence Vx Vy (fx = fy>x =y)a TVxdyfy =x 
has only infinite models (since there is no function on a finite set which is 
injective but not surjective). 

If one re-examines the proof of the completeness theorem for the case of 
uncountable symbol sets, one obtains the following generalization of 1.1, 
which we formulate for readers who are familiar with the concept of 
cardinality : 


1.2 Downward Léwenheim—Skolem Theorem. Every satisfiable set of formulas 
® < L’ is satisfiable over a domain of cardinality not greater than the cardi- 
nality of L, LI 


In 1.1 (and 1.2) a certain weakness of first-order languages is already 
apparent. In the case of the symbol set S;,, for example, there cannot exist 
a set ® of sentences which characterizes the ordered field R* = 
(R, +,-,0, 1, <) up to isomorphism (in the sense that exactly * and the 
structures isomorphic to ‘R* are the models of ®), Any such set ® of S;- 
sentences would be at most countable and satisfiable (since R“ — ® must 
hold); then by 1.1 there would be an at most countable structure YI such that 
Y — OD, But this could not be isomorphic to R* since R is uncountable. 

In analysis  * is characterized up to isomorphism, say, by the axioms for 
ordered fields and the axiom on Dedekind cuts. Since the former can be 
formulated in L°*", we see that the axiom on Dedekind cuts cannot be 
phrased in terms of S;;-formulas. 


1.3 Exercise. If ® is an at most countable, satisfiable set of formulas and if 
the equality symbol does not occur in any formula of ®, then ® is satisfiable 
over a countable domain. (Hint: In the proof of Henkin’s theorem use the 
set of terms instead of the set of classes of terms as the domain of Sy.) 


1.4 Exercise. Show that every at most countable set of formulas which is 
satisfiable over an infinite domain is satisfiable over a countable domain. 


§2. The Compactness Theorem 
From the definition of F and Con we obtained directly (cf. IV.6.1 and 
IV.7.4): 


(a) ME @ iff there is a finite Py c ® such that O | ¢. 
(b) Con ® iff for all finite Pp < ®, Con Py. 
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Using the adequacy theorem V.4.2 we can rephrase these results for the 
corresponding semantic concepts: 


2.1 Compactness Theorem 


(a) (,forthe consequence relation) 
Oe iff there isa finite Dy < © such that Do & ¢. 
(b) (,for satisfiability) 
Sat ® iff forall finite Dy c P, Sat Bo. 


The compactness theorem is so called because, in terms of a suitable 
topological reformulation, it says that a certain topology is compact (cf. 
Exercise 2.5). 

Wenow usethecompactness theorem to obtain variantsof the L6wenheim— 
Skolem theorem. 


2.2 Theorem. Let ® be a set of formulas which is satisfiable over arbitrarily 
large finite domains (i.e., for every n & N there is an interpretation satisfying 
® whose domain is finite and has at least n elements). Then ® is also satisjiable 
over an injinite domain. 


PROOF. Let 
Yi=@ u {@s,|2 <n} 


(Pn was introduced in III.6.3). Every interpretation which satisfies ‘¥ is a 
model of ® and has an infinite domain. Therefore we need only prove that 


is satisfiable. By the compactness theorem it is sufficient to show that every 
finite subset ‘Wo of ¥ is satisfiable. For each such ‘Wg there is an Hg € N such 
that 


(*) Fo SP u t{Ponl2 Sn T no}. 


According to the hypothesis of the theorem there is an interpretation 3 
which satisfies P and whose domain contains at least ng elements. By (*), 
3 is also a model of Wo. C 


2.3. Upward Liiwenheim-Skolem Theorem. Let ® be a set of formulas which 
is satisfiable over an injinite domain. Then for every set A there is a model of 
® which contains at least as many elements as A. (We say that M has at least 
as many elements as A if there exists an injective map of A into M.) 


Proor. Let ® c L*®. For each ae A let ¢, be a new constant (ie., c, ¢ S) 
such that c, # ¢, for a # b. First, we show that the set 


Y= @v {7A¢, = ¢]|a, be A, a # b} 


of Su {c,|a¢A)-formulas is satisfiable. Because of the compactness 
theorem we can restrict ourselves to showing, for every finite n-tuple of 
distinct elements @g,..., @,-1 € A, that 


f.1.\ Misi fae =rliteni tit 
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is satisfiable (cf. the argument in the previous proof). By hypothesis, there is 
an S-interpretation 3 = (23, 8) which satisfies ® and whose domain is 
infinite. Therefore there are n distinct elements by,...,6,-1 in B. We let 
ca; = b; for i < n. Then the interpretation ((23, cR,..., cB_,), B) satisfies the 
set (+). Since every finite subset of ‘¥ is satisfiable, we can find an inter- 
pretation 3’ which satisfies ‘¥ and hence also @. Let D be the domain of 3’. 
For a, b€ A such that a #5 we have 3’t= 4c, =c,. Hence 3(c,) and 
<3(c,) are distinct elements of D. Therefore the map n: A > D, where x(a) = 


~3(c,), is injective. Thus D has at least as many elements as A. LJ 


The same idea is used in the proof of the following theorem, which we 
state here for readers familiar with the concept of cardinality. 


2.4 Theorem of Lowenheim, Skolem, and Tarski. Let © be a set of formulas 
which is satisfiable over an injinite domain and let « be an injinite cardinal 
greater than or equal to the cardinality of ®. Then ® has amodel of cardinality k. 


ProoF. Let ® and t be given as in the statement of the theorem. Let A be a 
set of cardinality kx. We may assume that ® c L* for a symbol set § of 
cardinality < ti. Then the symbol set S u {c,]ae€ A} given in the proof of 
2.3 has cardinality ti as does the set of S u {c,|a € A)-formulas. Again, let 
Y= @u {7Ac, = ¢,|a,b€ A,a # b}. By 1.2 there is a model 3’ of ‘¥ (and 
hence also of ®) whose domain D has cardinality <ti. On the other hand, 
since 1¢, = c,€P for distinct a, be A, D has cardinality > x; hence its 
cardinality is exactly x, CJ 


2.5 Exercise. Let S be a symbol set. For every satisfiable set ® of S-sentences 
let We be an S-structure such that YW - ®, Further, write 

L := {A_|O c L3, Sat P}, 
and for every S-sentence @ set X, := {We L/W E og}. 


(a) Show that the system {X, |g € L}} is basis for a topology on =. 

(b) Show that every set X, is closed. 

(c) Use the compactness theorem to show that every open covering of & has 
a finite subcovering, so that & is (quasi-)compact. 


§3. Elementary Classes 


For a set ® of S-sentences we call 
Mod, © := {W/W is an S-structure and Qf = O} 


the class of models of ®. Instead of “Mods{g}” we sometimes write 
“Mods; ep”. 
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3.1 Definition. Let & be a class of S-structures. 


(a) ® is called elementary iff there is an S-sentence g such that 8 = Mods @. 
(b) ® is called A-elementary iff there is a set ® of S-sentences such that 
R = Mod, ®, 


Every elementary class is A-elementary. Conversely, because 


gpe® 


every A-elementary class is the intersection of elementary classes. 


From an algebraic point of view we can formulate the question of the 
expressive power of first-order languages as follows: Which classes of 
structures are elementary or A-elementary, i.e., which classes can be axio- 
matized by a first-order sentence g or by a set ® of first-order sentences? 

Let us give some examples. 


3.2. The class of fields (considered as S,,-structures) and the class ef ordered 
fields (considered as S§,,-structures) are elementary. For example, the first 
class can be represented in the form Mods__ gp, where @; is the conjunction 
of the field axioms in I11.6.5. Similarly, the class of groups, the class of equiv- 
alence structures and the class of partially defined orderings (cf. III.6.4) are 
elementary. 


Let p be a prime number. A field % has characteristic p if 
[8 4eecu 15 = 0% 
— ——-_$_— 
p-times 
that is, if % satisfies the sentence 
p-times 

If there is no prime p for which & has characteristic p, % is said to have 
characteristic 0. For every prime p the field Z/(p) of the integers modulo p 
has characteristic p. The field ‘R of real numbers has characteristic 0. 

Mods, (@r A Xp) is the class of fields € characteristic p. Hence this class 
is elementary. The class of fields of characteristic 0 is A-elementary; it can be 
represented as Mods, ({gf} u {7,|p is prime)). The following considera- 
tion will show that it is not elementary. Let @ be an S,,-sentence which is 
valid in all fields of characteristic 0, that is, 

{Pr} V {1XplP Is prime} F @. 
By the compactness theorem there is an ng (depending on @) such that 


{gr} u {Zp|P is prime, Pp < No} F . 
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Hence ¢ is valid in all fields of characteristic = Ng. 
Thus we have proved: 


3.3 Theorem. An S,,-sentence which is valid in all fields of characteristic 0 is 
valid in all fields whose characteristic is sufficiently large. LI 


We conclude from this that the class of fields of characteristic 0 is not 
elementary, for otherwise, there would have to be a sentence @ which is 
valid precisely in the fields of characteristic 0. 

As an instance of 3.3 one obtains the well-known algebraic result that two 
polynomials p(x) and o(x), whose coefficients are integral multiples of the 
unit element and which are relatively prime over all fields of characteristic 
Q, are also relatively prime over all fields of sufficiently large characteristic. 
In order to verify this, one rewrites the statement that p(x) and o(x) are 
relatively prime as an S,,-sentence. In the case p(x) := 3x” + 1 and o(x) := 
x* — 1 one can take the sentence 


dug Ju, Iwo Jw, 429 5z, 4z2 Vx 
(up + u,-xX)-(Wo tw, x)= A+ 14+ 1)-x-x4l 
A (uo + Uy -X)+ (Zo + 24 °X + 22°X*X) BX-X-x — I) 
A 5uo du, Vx (Up tu,-x)- (1 +14 1)-x-x4+1) =x-x-x-d. 


os 6s 


Here .=x-x-x — 1” stands for 
does not belong to S,, !). 


+ 1 =x-x-x” (the symbol — 


34. The class of Jinite S-structures (for a fixed S), the class of Jinite groups, 
and the class offinite fields are not A-elementary. The proof is simple: If, for 
example, the class of finite fields were of the form Mods, ®, then ® would 
be a set of sentences having arbitrarily large finite models (€.g., the fields of 
the form Z/(p)) but no infinite model. That would contradict 2.2. CI 


On the other hand, exercise 3.6 below shows that the corresponding 
classes of infinite S-structures (groups, fields) are A-elementary. 


35. The class of torsion groups is not A-elementary. We give an indirect 
proof, assuming (for a suitable set ® of S,,-sentences) Mods, ® to be the 
class of torsion groups. Let 
Y= OM {[4x0e---ox seln> I}. 
en 


n-times 
Every finite subset ‘Wy of YY has a model: Choose an ng such that 


Po << Ou {7xe ox Sell <n< no}; 


n-times 
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then every cyclic group of order no is a model of ‘Vo if x is interpreted by a 
generating element. Now let (G, £8) be a model of Y¥. Then f(x) does not 
have finite order, showing that 6 is a model of ® but not a torsion group. 


3.6 Exercise. Let R be a A-elementary class of structures. Show that the 
class R™ of structures in 8 with infinite domain is also A-elementary. 


3.7 Exercise. If & is a class of S-structures, ® < L} and R = Mods 9, then 
® is said to be a system of axioms for R. Show: 


(a) St is elementary if and only if there is a finite system of axioms for R. 
(b) If 8 is elementary and 8 = Mod, ® then there is a finite subset Dy of 
® such that R = Mod, Do. 


3.8 Exercise. A set ® of S-sentences is called independent if no pe @ is a 
consequence of ® — {g}. Show: 


(a) Every finite set ® of S-sentences has an independent subset Mp such that 
Mod, ® = Mod, Qo. 

(b) If S is at most countable then every A-elementary class of S-structures 
has an independent system of axioms. (Hint: Start by defining an axiom 
system Mo, M1, @2,-.. such that Fg;., > g; forie N.) 


3.9 Exercise. Let ® be the finite system of axioms for vector spaces expressed 
in terms of S = {F, V, +, -,0, 1, ®, e, *} (cf. IfL7.2). Show: 


(a) For every n the class of n-dimensional vector spaces is elementary. 
(b) The class of infinite-dimensional vector spaces is A-elementary. 
(c) The class of finite-dimensional vector spaces is not A-elementary. 


§4. Elementarily Equivalent Structures 


We begin by introducing two new concepts. 


4.1 Definition. (a) Two S-structures Wand 23 are called elementarily eguiv- 


alent (written: YW = 23) if for every S-sentence @ we have W t= iff 23 & ¢. 
(b) For an S-structure WF let Th(Q) = {@ € L3 |W -= —}. Th(QW) is called the 
(first-order) theory of YW. 


4.2 Lemma. For S-structures YW and 23, 


B=W if 23 Th). 
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Proor. If B = YW then, since Wt Th(Q), also Be Th(YL). Conversely, if 
Be Th(W) then, given an S-sentence @, we examine the two possibilities: 
(i) If We @ then me Th(W) and hence Be @. (ii) If not We ~, then 
—o € Th(W); thus Be 4@and therefore not BE @. L 


In the following let 20 be a fixed S-structure. We consider 


(1) the class {8|% = 2} of structures isomorphic to YW; 
(2) the class of structures which satisfy the same sentences as YI, Le., the 
class {8|8 = YW} of structures elementarily equivalent to YI. 


From the isomorphism lemma II1.5.5 it follows directly that isomorphic 
structures are elementarily equivalent, that is, 


(+) {B|B x Wc {B/B = W. 


4.3 Theorem. (a) /f Wi is infinite then the class (23 (23% } is not A-elementary; 
in other words, no injinite structure can be characterized up to isomorphism 
in afirst-order language. 

(b) The class {8|8B =} is A-elementary; in fact {B}/B= UW} = 
Mod, Th(Y). Moreover, {8/8 = MY} is the smallest A-elementary class 
which contains YI. 


From 4.3 together with (+) we obtain the result that for infinite Yl the 
class @3|8 = W} must be a proper subclass of @3 |B = YW}; in particular: 


4.4 Corollary. For each infinite structure there exists an elementarily equiv- 
alent, nonisomorphic structure. 


PROOF OF 4.3. (a) We assume YI to be infinite and ® to bea set of S-sentences 
such that 


(x) Mod, ® = {8|B ~ W. 


® has an infinite model, and hence by 2.3, it has a model B with at least as 
many elements as the power set of YW. Hence B is not isomorphic to Y, in 
contradiction to (4). 

(b) From 4.2 it follows immediately that {8} = W} = Mods Th(Q). 
Now, if Mods ® is another A-elementary class containing YW, then We O 
and therefore 2B t= © for every B such that B = YW; hence {Bi B= Wh 
Mod, ®. al 


4.3(b) shows that a A-elementary class contains, together with any given 
structure all elementarily equivalent ones. In certain cases one can use this 
fact to show that a class & is not A-elementary. To do this one simply specifies 
two elementarily equivalent structures, one of which belongs to 8, and the 
other does not. We illustrate this method in the case of archimedean fields. 
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An ordered field % is called archimedean if for every a €F there is a 
natural number n such that 


a<PiPa... 4%, 
—_——_——_—_—— 
n-times 


For example, the ordered field of rational numbers and the ordered field R* 
of real numbers are archimedean. We show that there is an ordered field 
elementarily equivalent to R* which is not archimedean. Then we shall 
have obtained : 


4.5 Theorem. The class of archimedean fields is not A-elementary. 


PROOF. Let 
Y = THOR) u {(Q< x,1<x,2<x,...}, 


where 0, 1, 2,... stand for the S,,-terms 0, 1, 1 + 1,.... Every finite subset of 
Y is satisfiable, for example, by an interpretation of the form (‘R~, f), where 
B(x) is a sufficiently large natural number. By the compactness theorem 
there is a model (23, fp’) of ‘P. Since B f= Th(R*), B is an ordered field 
elementarily equivalent to K*, but (as shown by the element f’(x)) it is not 
archimedean. 


The application of the compactness theorem in the preceding proof is 
typical and has already been used several times (cf. 2.3, 3.5). In each case the 
problem consists in finding a structure with certain properties which can 
be expressed in first-order language by means of a suitable set ‘¥ of formulas. 
To prove satisfiability of ‘¥ one employs the compactness theorem. In the 
preceding proof ¥ contains (in addition to Th(R*)) formulas which 
guarantee that there is an element which violates the archimedean ordering 
property. The compactness theorem says in this case that, from the existence 
of ordered fields with arbitrarily large "finite'’ elements, one can conclude 
the existence of an ordered field with an "infinitely large'' element. We shall 
give some further examples. 

The axiom system II from III.7.5 characterizes the structure Jt up to 
isomorphism. However, 93t cannot be characterized up to isomorphism by 
means of first-order formulas (cf. 4.4). Hence the induction axiom, being the 
only second-order axiom of II, cannot be formulated as a first-order formula 
or as a Set of first-order formulas. 

A structure which is elementarily equivalent, but not isomorphic, to 
¥t is called a nonstandard model of arithmetic. The proof of 2.3 shows that 
there exists an uncountable nonstandard model of arithmetic. We now prove: 


4.6 Skolem's Theorem. There is a countable nonstandard model of arithmetic. 


PROOF. Let 
Y:= Thy) u {ix=hi x =lWnx = wy. }. 


§4. Elementarily Equivalent Structures 97 


Every finite subset of ‘¥ has a model of the form (i, £), where f(x) is a 
sufficiently large natural number. By the conipactness theorem there is a 
model (QW, 8) of ‘¥, which by the Lowenheim-Skolem theorem we may 
assume to be at most countable. YI is a structure elementarily equivalent to 
Yt. Since for m # n the sentence 4m = n belongs to Th(Jt), Y is infinite and 
hence it is countable. 3t and Yf are not isomorphic, since an isomorphism 2 
from tonto YF would have to mapn = n™ to n™ (cf. (i) in the proof of III.5.5), 
and thus f(x) would not lie in the image of z. LI 


Considering the set Th(93t*)U {ix =O, i x =1, i x =2,...), we 
obtain analogously: 


4.7 Theorem. There is a countable structure elementarily equivalent to Kt* 
which is not isomorphic to Nt .(Inother words, there is a countable nonstandard 


model of Th(3t*).) C] 


What do nonstandard models of Th($3t) or Th(Yt*) look like? In the 
following we gain some insight into the order structure of a nonstandard 
model Y of Th(9*) (and hence also into the structure of a nonstandard 
model of Tht); cf. exercise 4.9). 

In $t* the sentences 


¥x(Q = x vO < x), 
O<1aVWx0O<x7~1l=xvi <x), 
1<2an Vx <x72=xv2<>x),... 
hold. They say that 0 is the smallest element, | the next smallest element after 


QO, 2 the next smallest element after 1, and so on. Since these sentences also 
hold in YT, the "initial segment" of Wf looks as follows: 


In addition, 2{ contains a further element, say a, since otherwise % and Yt 
would be isomorphic. Furthermore, 9¢ and hence YI satisfy a sentence @ 
which says that for every element there is an immediate successor and for 
every element other than Q there is an immediate predecessor. From this it 
follows easily that A contains, in addition to a, infinitely many other elements 
which together with a are ordered by <4 like the integers: 


O* 14+ DA a 
If we consider the element a + 4a we are led to further elements of A: 


+——+——_+--—+ --- +++ <«<—__}—_ >... + -<____|—__> . - - . 
Of 44 «24 a a+“a 
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The reader should give a proof of this and also verify that between every two 
copies of (Z, <“) there lies another copy. 

The examples in this and the previous section show that there are impor- 
tant classes of structures which cannot be axiomatized in a first-order 
language. On the other hand, this weakness of expressive power also has 
agreeable consequences. For example, the argument establishing that the 
class of archimedean fields is not axiomatizable yields a proof of the existence 
of non-archimedean ordered fields; and the fact that the class of fields of 
characteristic Q cannot be axiomatized by means of a single S,,-sentence is 
complemented by the interesting result 3.3. Using similar methods, one can 
obtain structures elementarily equivalent to the ordered field ‘R~ of real 
numbers which contain, in addition to the real numbers, infinitely large 
elements and infinitely small positive elements (infinitesimals). Such struc- 
tures can be used in a development of analysis which avoids the é — 6- 
technique (nonstandard analysis; cf. [14], [23]), Thus we see that the first- 
order languages turn out to bea useful tool in various areas of mathematics. 
Semantic investigations of this kind belong to the subject called model 
theory. We refer the reader to [4] for further information. 


4.8 Exercise. Show that a sentence which is valid in all non-archimedean 
ordered fields is valid in all ordered fields. 


4.9 Exercise. Let the S,,-structure WI be a model of Th($t). Let the binary 
relation <4 be defined as follows: 


For all a, be A, a <“b iffa ¥ band there is c€ A such that 
at4c=b. 


Show that (WI, <“) is a model of Th(%*). 


4.10 Exercise. If 2{ is a model of arithmetic (that is, Wt Th(9t)) and if 
a, be A, then a is said to be a divisor of b (written: a}b) if a 4c = b fora 
suitable c € A. Let Q be a set of prime numbers. Show that there is a model 
Yl of arithmetic which contains an element a whose prime divisors are just 
the members of Q, that is, for every prime p: 


i4+...4+ I4/a iff peg. 
‘eae siteenee’ 
p-times 
Conclude that there are uncountably many pairwise nonisomorphic count- 


able models of arithmetic. 


4.11 Exercise. Let 2 = (A, <“) be a partially defined ordering (cf. ITI.6.4). 
We say that <4 (or also (A, <4)) has an infinite descending chain, if there 
are elements ay, @,,42,... inthefieldof <4 suchthat...<4a, <4 a, <4ap. 
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Show: 


(a) (N, <) contains no infinite descending chain; on the other hand, if A 
is a nonstandard model of Th(9t*<), then (A, <“) contains an infinite 
descending chain. 

(b) Let < € Sand ® < L%. Assume that for every m€ N there is a model YW 
of ® such that (A, <4) is a partially defined ordering and the field of 
<4 contains at least m elements. Then there exists also a model 8 of ® 
such that (B, <¥) is a partially defined ordering containing an infinite 
descending chain. 


CHAPTER VII 
The Scope of First-Order Logic 


In the introductory chapter we realized that investigations into the logical 
reasoning used in mathematics require an analysis of the concepts of mathe- 
matical proposition and proof. In undertaking such an analysis, we were 
led to introduce the first-order languages. Further we defined a notion of 
formal proof which corresponds to the intuitive concept of mathematical 
proof. The completeness theorem then shows that every proposition which 
is mathematically provable from a system of axioms (and thus follows from 
it) can also be obtained by means of a formal proof, provided the proposition 
and the system of axioms admit a first-order formulation. 

In this chapter we discuss what has been achieved so far and what impli- 
cations this has for the foundations of mathematics. To start our discussion 
let us consider the following questions: 


(1) One goal of our investigations was a clarification of the notion of proof. 
However, we carried out mathematical proofs before the notion of proof 
was made precise. Are we not trapped in a vicious circle? Further, even 
if there are no problems of this kind in our approach, how can we then 
justify the rules of the sequent calculus ©? 

(2) We realized, particularly in Chapter VI, that the first-order languages 
have certain deficiencies in expressive power. Hence the question: What 
effect does the restriction to first-order languages have on the scope of 
our investigations? 


We deal with the second question in §2. There we shall see that the first- 
order languages are in principle sufficient for the mathematics of today. 
Hence the following discussion, pertaining to the first question, applies, in 
fact, to the whole of mathematics. 


$1. The Notion of Formal Proof 


1 The Notion of Formal Proof 


In answering question (1), we want to show that no mathematical proofs are 
needed to introduce the notion of formal proof. In our discussion we also 
investigate the nature of the sequent rules and consider possible means of 
justifying them. 

In §2 we shall argue that a finite set S$ of concretely chosen symbols 
suffices to represent the statements and arguments arising in mathematics. 
Therefore in this discussion we can specify the symbols as concrete signs; 
thus terms, formulas, and sequents are concrete strings of symbols and not 
abstract mathematical entities such as are, for example, formulas in a 
language whose symbol set is {c, |r € R} (cf. 11.1). 

The notion of formal proof is based on the manipulation of symbol 
strings such as terms, formulas, and sequents. These manipulations are 
governed by a series of calculi, like the calculus of terms and the sequent 
calculus. The application of rules in these calculi consists of simple syntactic 
operations. We illustrate this in the case of the sequent calculus. To clarify 
the aspect we have in mind let us start by a comparison with the rules of 
chess. 

The rules of chess permit certain operations on concrete objects, the chess 
pieces. Applying a rule, that is, making a move, consists of proceeding from 
one configuration of pieces to another. Each individual rule of chess is so 
simple that everyone who knows the rules—even if he is not a chess player — 
can carry out moves by himself, or can check moves to determine whether 
they were made according to the rules. 

A similar situation pertains in the case of sequent rules. Clearly the rules 
are motivated by the intended meanings (of 74, v, =,...), but their applica- 
tion does not require any knowledge of these meanings: one merely performs 
concrete syntactic operations on strings of symbols. Anyone who knows the 
rules—even if he is not a logician or a mathematician— can apply them and 
can check whether an application has been carried out correctly. Admittedly, 
when dealing with sequents, we have often relied on mathematical proposi- 
tions (for example, we invoked the unique decomposition of a sequent into 
formulas when speaking of the succedent). But this can be avoided if, when 
applying a rule, we not only note the sequent, but also keep a record of how 
the symbol strings in it were obtained. We give some examples. 


(a) Let O, and @, be sequents which occur in a derivation. One reads from 
the record accompanying the derivation that ©, was obtained by forming 
a string from @p,..-., @,, and that @, was obtained similarly from 
Wo,.-.5 W,- If one wants to apply the rule ( v A), for example, one must 
first check whether n = m > 1, and whether the symbol strings @, and 
W; agree for every i # n — 1. If so, one can apply (v A) by forming the 
symbol, string @o... @n-2(@n-1 V Wn-1)@, from the components 
(Po,.++5 Pn—2> Pn—1> Wn—1> Ons (, VY, and ). Moreover, one notes in the 
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record that this symbol string was obtained from the components 
Po>-.+>Pn-25 (Pn-1 BA Wn—1)s and Pn: 

(b) An application of the rule (=) consists of writing down a sequent of the 
form t = t, where the term ¢, for its part, has to be given by means of a 
derivation in the calculus of terms (cf. II.3.1). 

(c) Similarly, when one uses the rule (JA) to proceed from the sequent 


lo " y to the sequent I’ 4xg w, one must supply a derivation of oxy@ " 


in the substitution calculus (cf. III.8.11), and, for every x in I dx@ wy, one 
must supply a derivation of y x in the calculus of nonfree occurrence for 
variables (cf. II.5.2) in order to show that the condition ” yis not free in 


I Axg w” is fulfilled. Then, starting from the sequent Tg " yw, one needs 


only to write down the sequent I ax@ w. 


From these examples it becomes clear that an application of the sequent 
rules consists of purely syntactic manipulations which can be carried out 
without any reference to mathematical arguments. Since, by definition, a 
formal proof is just a sequence consisting of sequents, each of which is 
obtained by an application of a sequent rule to preceding sequents, it is 
obvious from our previous remarks that no mathematical proofs are needed 
in order to introduce the notion of formal proof. Thus our approach is not 
circular. The proofs we have given before defining the notion of formal 
proof, and the mathematical arguments we have used in building up the 
semantics, merely served the purpose of gaining insight into first-order 
languages and of motivating our development. 

However, a word of warning is in order when considering this reduction 
of the notion of proof to a triviality by the calculus of sequents: We have seen 
that no mathematical talent, only patience, is needed to verify a formal proof 
in accordance with the rules; but it is a completely different matter to under- 
stand the idea of a proof, not to speak of developing such ideas oneself. 
Likewise, in chess there is also a great difference between knowing the rules 
and being able to checkmate a skillful opponent. Thus when determining 
the notion of formal proof we did not really touch upon the more creative 
part of mathematical activity (and this includes not only the development of 
proof ideas, but also the introduction of adequate concepts, setting up suitable 
systems of axioms, and finding new interesting conjectures). 

Does our formal notion of proof provide a justification of common 
mathematical reasoning? Certainly not; for we have merely imitated 
methods of proof in the framework of a precisely defined language. However, 
we can at least claim that the sequent rules correspond to the normal usage 
of connectives, quantifiers, and equality in mathematics. For example, the 
v -rules reflect the use of the inclusive "or”, according to which the disjunc- 
tion of two propositions is true if and only if at least one of the propositions 
is true. Admittedly, such usage of "or" rests on certain assumptions; for 
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example, it must be meaningful to speak of the truth or falsehood of a 
mathematical proposition, and every such proposition must be either true 
or false (tertium non datur). In traditional mathematics (which in this 
regard is also called classical mathematics) these assumptions are accepted. 
Thus the rules of the sequent calculus are based upon the classical usage of 
the logical connectives. 

Some mathematicians engaged in foundational questions, among them 
intuitionists, do not share the classical point of view. An intuitionist associates 
with the assertion of a mathematical proposition the requirement that it be 
proved in a"'constructive" way. For instance, an existential statement must 
be proved by presenting an example, and a disjunction must be proved by 
establishing one of its members. To illustrate this we consider the following 
two statements. 


A: Every even number >4 is the sum of two primes (Goldbach's 
conjecture); 


not A: Not every even number 2 4 is the sum of two primes. 


From the classical point of view (A or not A) is true. However, an intuitionist 
cannot assert (A or not A) since neither the proposition A nor the proposition 
(not A) has hitherto been proved (even using classical methods). 

This example already shows that mathematics as pursued by an in- 
tuitionist, the so-called intuitionistic mathematics (cf. [17]), differs consider- 
ably from classical mathematics. Intuitionists investigate "mental mathe- 
matical constructions as such, without reference to questions regarding the 
nature of the constructed objects, such as whether these objects exist inde- 
pendently of our knowledge of them" (cf. [17], p. 1). By contrast, some 
mathematicians adopt the classical point of view from the conviction that 
"the objects in mathematics, together with the mathematical domains, 
exist as such, like the platonic ideas" ([24], p. 1), 1e, that propositions 
concerning these objects describe properties which either do or do not hold, 
and hence are either true or false. 

We see from this discussion that the possibilities for justifying methods 
of mathematical reasoning (and specifically for justifying a proof calculus) 
depend essentially on epistemological assumptions. We shall continue to 
adopt the classical point of view. 


§2. Mathematics Within the Framework of 
First-Order Logic 


In this section we wish to discuss the latter question raised at the beginning 
of the chapter: How serious is the restriction to first-order languages? 

To treat this question we start with the example of arithmetic. In this 
case, the weakness of the expressive power of first-order languages manifests 
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itself in the fact that the structure 9t, = (N, o, 0) (cf. I[I.7.3) cannot be charac- 
terized up to isomorphism in L'%-°!. On the other hand, according to 
Dedekind's theorem, t, can be characterized in a second-order language by 
the Peano axioms (cf. If1.7.4): 


(Pl) Vx m0x =0. 
(P2) Vx Vox = gy > x = y). 
(P3) VX((X0 a Vx(Xx — Xox)) > Vy Xy). 


Let us call a structure which satisfies (P1)-(P3) a Peano structure. Then we 
can formulate Dedekind's theorem as follows: 


2.1. Any two Peano structures are isomorphic. 


Since Peano structures cannot be characterized in the first-order language, 
one might suspect that the result 2.1 cannot be formulated in the framework 
given by first-order logic, and in particular, that its proof in III.7.4, which 
involves (P1)-(P3), cannot be carried out within this framework. Never- 
theless this can be achieved as we now show. 

First let us note that in 2.1 a statement is made about {g¢, 0)-structures. 
We want to interpret 2.1 as a statement about a domain which comprises 
as elements all Peano structures and also with any two such structures an 
isomorphism between them. Furthermore this domain should contain the 
elements and subsets of Peano structures, since these also play a réle in the 
formulation of (P1)-(P3) and in the proof of 2.1. 

In order to avoid drawing arbitrary boundaries and to enable us to 
apply our discussion to other propositions besides 2.1, we shall consider as 
domain the totality of all objects which are treated in mathematics; this we 
shall call the (mathematical) universe. The universe contains not only 
"simple" objects, such as the natural numbers or the points of the euclidean 
plane, but also" more complicated ” objects, such as sets, functions, structures, 
or topological spaces. A mathematician assumes in his arguments that this 
universe has certain properties: for example, that for every two objects a 
and a, the set /a,,a,) exists, likewise for any two sets M,, M, the union 
M, u M,j, and for every injective function f the inverse f~'. Mathematical 
statements can then be regarded as propositions about the universe. From 
this point of view, 2.1 says that for every two Peano structures YW and 23 in 
the universe there is another object in the universe which is an isomorphism 
between YW and 23. 

Now it is possible to present in a suitable first-order language a rather 
simple set of sentences expressing all the properties of the universe which 
mathematicians use. Proposition 2.1 can also be formalized in this language. 
In other words, 2.1 can be formalized as a proposition about the universe 
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in a first-order language L* appropriate to the universe, just as the proposi- 
tion "there is no largest real number" can be formalized as a proposition 
about the structure (R, <™) in the language L'“! appropriate to (R, <*). 

Wecarry out some steps of this idea more carefully: A preliminary analysis 
of the totality of mathematical objects leads us to a symbol set which is 
suitable for the universe. In a second step we present parts of a system Po 
of axioms which express those properties of the universe used in mathematics. 
(A complete presentation of such a system ®, follows in §3.) Finally, we 
indicate how to obtain a first-order formalization of 2.1. 

When introducing the universe, we spoke of "simple" objects (numbers, 
points,...) and "complex" objects (sets, functions, ...). For the sake of 
simplicity we make use of the empirical fact that the whole spectrum of 
"complex" objects can be reduced to the concept of set. (We shall carry out 
this reduction for ordered pairs and functions.) We call the ''simple" objects 
urelements. Thus, the universe contains only urelements and sets. The sets 
consist of elements which are either urelements or else sets themselves. 
Therefore Po essentially collects basic properties of sets and hence is called 
a system of axioms for set theory. 

We use the unary relation symbols U (“... is an urelement") and M 
(“... is a set") to distinguish between urelements and sets, and we use the 
binary relation symbol € for the relation "is an element of...” . Thus we are 
led to the symbol set S:= {U, M, €}. 

Now we give four axioms from ®, which formalize simple properties of 
the universe. 


(AO) ¥x(Ux v Mx) 
"Every object is an urelement or a set”. 


(Al) ¥x 7(Ux a Mx) 
"No object is both an urelement and a set”. 


(A2) Vx Vy((Mx a My a VozE€x@zey))>x =y) 
"Two sets which contain the same elements are equal". 


(A3) Vx Vy dz(Mza VutuezoU=xvuz y))) 
"For every two objects x and y, the pair set {x, y} exists". 


The set z, whose existence is guaranteed by (A3), is uniquely determined 
by (A2). Repeated application of (A3) yields the existence of the set {{x, x), 
{x, y}}. This set is normally written (x, y) and called the ordered pair of x 
and y. It is not difficult to show from (AQ)-(A3) that 


(x,y)=(',y') iff x=x' and y=y¥. 
Ordered triples can then be introduced by 


(x, y, Z) = ((x, y), z). 
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In order to obtain formalizations which are easier to read, we introduce a 
number of abbreviations. 


(<c)xcy forMx a My a V2(zEx >zE€y). 
("xis a subset of y”) 


(Instead of treating "x © y" as an abbreviation for a formula of L* we could 
have added the binary relation symbol < to S and expanded ®, by adding 
the axiom 


Vx V(x S yo (Mx a My a Ve(zEex- ze y))). 
Both approaches are equivalent, as we shall see in VIIL!.) 


(OP) OPzxy for Mz 4 Vuluez(Mu a (Vo(vEuov=x) 
Vv VoL Eu o(v = xv v=y))))) 
("zis the ordered pair of x and y”) 


(OT) OTuxyz for Mu a 4v(OPuvz a OPvxy) 
(“u is the ordered triple (x, ), z) as defined above"’) 


(E) Euxy for Mu a 42(z€u a QPzxy) 
("The ordered pair (x, y) is an element of 11”) 


(F) Fu for Mu a V2(z €u — 4x SyOPzxy) a 
Vx Vy Vy'((Euxy a Euxy’) > y = y’) 
("wis a function, that is, a set of ordered pairs (x,y), where y is the 
value of u at x”). 


By means of (F) the concept of function is reduced in the usual manner to 
that of set: a function f: A — B is considered as the set {(x,f(x)]x€ A), 
which is also referred to as the graph of f. 


(D) Duv for Eu a Mt a Yx(x Evo JyEuxy) 
("uis the domain of the function u”) 


(R) Ruv forFu a Mv a Vy(y €v Ax Euxy) 
(“v is the range of the function uw”). 


For simplicity we regard a {g,0)-structure as an ordered triple (x,y, z) 
consisting of a set x, a function y: x > x, and an element z of x. Then the 
following abbreviation “ PSu” expresses that u is a Peano structure, whereby 
parts (1), (2), and (3) are formulations of the Peano axioms (P1), (P2), and 
(P3), respectively. 


(PS) PSu_ for 4x dy 32@OTuxyz aA Mx azex 
A Ey a Dyx a dv(Ryv avex)ja 
(1) VwoEywz a 
(2) Vw Vw" Vo((Eywo a Eyw'v) > w = w') A 
(3) VxX'(x Cx AzEX 
AYVwVu((we x’ A Eywe) > ve x'))> x' =»). 
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The final abbreviation “fwuu'” states the property that w is an isomorphism 
of the Peano structure u onto the Peano structure w’: 


(1) Iwuu' for PSu a PSu' a Fw 
A 4x dy 3z ax’ dy’ 32z’(OTuxyz a QTu'x'y'z' 
A Dwx A Rwx’ 
A Vrs Vu((Ewre’ a Ewsv') > r = s) 
A Ewzz' a Vou Vr((Eyor a Ewor’) > Sr'(Ewrr’ a Ey'v'r’))). 


Thus the following is a formalization of 2.1 
(+) Vu Vo(PSu a PSv > 4wl wut). 


Clearly, (+) is a {U, M,5)-sentence. So we have attained our goal of 
formulating 2.1 within a first-order language. This was possible because we 
did not distinguish between different types of mathematical objects, such 
as natural numbers and sets of natural numbers, but simply treated all 
objects in the universe as first-order ones (compare (P3) and (3) in (PS)). 
We can achieve even more: Recall that the system Py, (which we have given 
only in part) captures all properties of the universe needed for mathematical 
reasoning. By rewriting in L> the proof of Dedekind's theorem 2.1 (cf. III.7), 
onecan obtain aproof that leads from axioms of ®, to the assertion (+) using 
only sequent rules. Hence we have: 


2.2. Do H Vu Vu(PSu a PSv — 4wlwuv). 


This procedure can be generalized: 

Experience shows that all mathematical propositions can be formalized in 
LS (or in variants of it), and that mathematically provable propositions have 
formalizations which are derivable from Dy. Thus it is in principle possible to 
imitate all mathematical reasoning in L* using the rules of the sequent calculus. 
In this sense, first-order logic is sufficient for mathematics. At the same time 
this experience shows that the properties of the universe which are expressed 
in Dy are a sufficient basis for a set-theoretic development of mathematics. 
Thus ®, is aformalization of the set-theoretic assumptions about the universe 
upon which the mathematician ultimately relies. Since these set-theoretic 
assumptions can be viewed as the background for all mathematical con- 
siderations, we call @ , in this connection, a system of axioms for background 
set theory. 

On the other hand, ®, itself, like any other system of axioms, can also 
be the object of mathematical investigations. For example, one can ask 
whether ®, is consistent or study the models of ®), Such a model has the 
form W& = (A, U*, M*, €4) and is, like every structure, an object of the 
universe, that is, an object in the sense of background set theory. The same 
is true of the domain A. Thus as an object of the universe, A is distinct from 
the universe. (In particular the universe is not the domain of a model of ®y.) 
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Nevertheless, in a model W = (A, U*, M4, €*) of po, all set-theoretical 
statements hold which are derivable from ®,; but note that, for example, 
a €‘ b (for a, b € A) does not mean that a is an element of b, 1.¢., that a € b 
holds. 

Let us emphasize once again that Py plays two réles: It is both an object 
of mathematical investigations and a formalized description of basic proper- 
ties of the universe. In other words, it is both a mathematical object and a 
framework for mathematics. 

Thus we have two levels, "object set theory" and "background set theory", 
which must be carefully distinguished. Many paradoxes arise from a con- 
fusion of these two levels. In §4 we shall discuss this in more detail. For the 
present we merely mention Skolem's paradox. It is well known that there are 
uncountably many sets (for example, there are uncountably many subsets of 
N). This fact can be formalized by a sentence ¢, which is derivable from ®q. 
By the Lowenheim-Skolem theorem there is a countable model Y of Do 
and hence of ~. The countable model YF thus satisfies a sentence which says 
that there are uncountably many sets in WU! 


§3. The Zermelo—Fraenkel Axioms for 
Set Theory 


We now present in full a system of axioms for set theory. For a more detailed 
exposition we refer the reader to [8] or [9]. 

In §2 we assumed that the universe consists only of sets and urelements, 
and we saw with the help of set-theoretic definitions for concepts such as 
"ordered pair" and "function" that this assumption is really no restriction. 
Furthermore, experience has shown that one can even replace the urelements 
arising in mathematics by suitable sets. Hence in what follows we shall 
assume that the universe consists only of sets. Later, as an example, we shall 
give a set-theoretic substitute for the natural numbers. 

Since we are abandoning the use of urelements, the symbols U and M 
become superfluous. Therefore we formulate the axioms in L‘!, where the 
variables are intended to range over the sets of the universe. The resulting 
system of axioms, called ZFC, is originally due to Zermelo and Fraenkel, 
and includes the axiom of choice. 

ZFC contains the axioms EXT (the axiom of extensionality), PAIR (the 
pair set axiom), SUM (the sum set axiom), POW (the power set axiom), INF 
(the axiom of infinity), AC (the axiom of choice), and the axiom schemes SEP 
(separation axioms) and REP (replacement axioms).' 


EXT: Vx Vy(VzzExeozey) rex =y). 
(Two sets which contain the same elements are equal.) 


? Often one also includes the so-called axiom of regularity. 
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SEP: For each 9(z,X,...,X,-1)° and arbitrary distinct variables x, y 
which are also distinct from z and the x;, the axiom 


Vx... WXq—1 Vx dy W2(z E YO(ZEX A OlZ, X0,-.., Xn-1))). 


(Given a set x and a property P which can be formulated by an 
{€}-formula, the set {z € x|z has the property P) exists.) 


PAIR: Vx Vy dz Vw(w ez (WwW = x V wW=Yy)). 
(Given two sets x, y, the pair set {X, y} exists.) 


SUM: Vx dy V2(zE yo dw(wex A zZ€w)). 
(Given a set x, the union of all sets in x exists.) 


POW: Vx dy V2(z € yo Vw(w €z > wEx)). 
(Given a set x, the power set of x exists.) 


In order to formulate the remaining axioms more conveniently, we in- 
troduce some defined symbols. The considerations in VIII.1 show that 
formulas which contain these symbols can be regarded as abbreviations of 
{€}-formulas. The symbols and their definitions are: 


© (constant for the empty set): 
VAS = yo Vz TZEy). 
c (relation symbol for the subset relation): 
Vx W(x C yo V2(zEx > ZEY)). 
{, } function symbol for pairing): 
Vx Vy Vz(ix, y) = zo Vw(w €zO(Ww = x V w= y))). 


(For the term {y, y} we often write the shorter form {y}.) 
v (function symbol for the union): 


Vx Vy Vax Uy = zo Vow EZ (WEX v WEY))). 
n (function symbol for the intersection): 
Vx Vy Vex Oy = zeVw(wezeo(wex a we y))). 
P (function symbol for the power set operation): 
Vx Wy(Px yoVe(zEyozcx)). 
The remaining axioms of ZFC are as follows: 


INF: Ax(OeEx a VWyEex yu ly} Ex)). 
(There exists an infinite set, namely a set containing @, {@}, 


(DAS ay 


? Here and in the following we write W(Vo,.--»¥n—1) to indicate that the variables occurring 

free in are among the distinct variables yo,..-, ¥a—1- 

3 At a first glance it might be more natural to demand that there exists a set containing @, {}. 
Qa). y- This is Zermelo's original version. However, our formulation of INF (due to von 

Neumann) has become customary because of numerous advantages. 
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REP: For each (x, y, X9,---,X,—1) in L'@ and all distinct variables u, v 
which are also distinct from x, y, X9,.-.; X,-1 the axiom 


VX iene Vi (Vx a "ye(x, Vs XQs-++5 Xn-1) 
— Vu dv Vy(y € v > Axx EU A OLX, Y, Xo. -++s Xn-1)))). 


(If for parameters Xg,...,X,-, the formula @(x, y, Xo,...5 Xn-1) 
defines a map x > y, then the image of a set is again a set.) 


AC: Vx(7G ex a VWuVo(uex avEexa Tu=v)7unve= @)) 
> dyVw(wex 737 'zzewny)). 


(Given a set x of nonempty pairwise disjoint sets, there exists a set 
which contains exactly one element of each set in x.) 


Within the framework of ZFC one can now introduce the notions of 
ordered pair, ordered triple, function, etc. as we did in the preceding section. 
Moreover, as already mentioned above, experience shows that ZFC also 
permits one to substitute suitable sets for urelements, as we demonstrate 
below in the case of the natural numbers. Thus the insight stated (for ®)) in 
the previous section also applies to ZFC: All mathematical propositions can 
be formalized in L'*', and provable propositions correspond to sentences 
derivable from ZFC. 

We now effect a set-theoretical substitute for the natural numbers. 
Moreover in our present framework we exhibit a Peano structure which 
can play the role of Si,. 

The sets 0 :=@, 1 :={@}, 2:={@, {P|} },... will play the réle of the 
natural numbers 0, 1, 2,.... Thus 0= @, 1 = {Q}, 2 = {0,1}, and in 
general n = {Q,1,..., n— 1). Let us call a set inductive if it contains @, and 
if whenever it contains x it also contains x u {x); then the smallest inductive 
set assumes the r6le of Jt, It remains to show that the statement "there is a 
smallest inductive set" is derivable in ZFC. We give a guideline as to how to 
proceed. By INF there exists an inductive set, say x. Using SEP we obtain 
the set 


w:= {z|z € x and for all inductive y, z€ y), 


which can be shown to be the smallest inductive set. The function v: w — o, 
where w(x) = x u {x) for x € @ (ie., the function v = {(x,x u {x})|x€w)) 
plays the réle of the successor function. One can then see that (w, v, Q) is a 
Peano structure. 

We close our presentation of ZFC with an important methodological 
aspect by briefly discussing the so-called continuum hypothesis. This hypo- 
thesis was stated at the end of the nineteenth century by G. Cantor and has 
had a crucial influence on the development of set theory. 

Two sets x, yare said to bed the same cardinality (written: x ~ y) if there 
is a bijection fromx toy. A set is finite if and only if it is of the same cardinality 
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as an element of ; it is countable if it is of the same cardinality as cy. The set 
R of real numbers (the "continuum" ) is uncountable (cf. exercise If.1.3). 
Now the continuum hypothesis states: Every infinite subset of R is either 
countable or of the same cardinality as R, 
Using canonically defined symbols R, Fin,..., this statement can be 
formulated in L'*' in the following form: 


Vx((x < R a Fin x) > (Count x v x ~ R)). 


This formula is denoted by "CH" (Continuum Hypothesis). The question 
whether the continuum hypothesis holds corresponds to the question whether 
CH is derivable from ZFC. 

K. Gédel showed in 1938: 


3.1. If ZFC is consistent then not ZFC F 1CH, 
and P. Cohen showed in 1963: 


3.2. If ZFC is consistent then nor ZFC & CH. 


Thus if we assume that ZFC is consistent (cf. §4), then neither CH nor 
CH is derivable from it. 

According to previous remarks, ZFC embodies our knowledge of the 
intuitive concept of set which mathematicians in fact use. In the light of the 
results of Gdédel and Cohen we see that our concept is so vague that it does 
not definitely decide the truth or falsehood of the continuum hypothesis. 
One can even show (cf. X.7) that it is not possible to give "explicitly" an 
axiom system ‘Y for set theory, which decides every set-theoretic statement y 
(in the sense that either M+ wor ¥E 7§). 
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In this section we supplement our previous discussion by treating three 
aspects: In 4.1, taking ZFC as an example, we show how the question of the 
consistency of mathematics may be made precise by the use of suitable 
first-order axioms sufficient for mathematics. In 4.2 we discuss misunder- 
standings which may arise from a confusion of object set theory with back- 
ground set theory. Finally, in 4.3 we show how first-order logic, like every 
other mathematical theory, can be based on set theory. 


4.1. In the preceding sections we have emphasized the experience that 
provable mathematical statements can be formalized by {¢}-sentences 
which are derivable from ZFC. Taking this for granted, suppose it were 
possible in mathematics to prove both a statement and its negation. Let g 
be a formalization of this statement. Then both ZFC @g and ZFC + 7A@ 
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would hold and thus ZFC would be inconsistent. Therefore, a proof that 
ZFC is consistent could be regarded as strong evidence for the consistency 
of mathematics. In fact, the question of the consistency of ZFC is one of the 
key problems of foundational investigations. In an explicit formulation it 
asks: Is there a derivation in the sequent calculus of a sequent of the form 
Qo.+» Pn—-1(P aA 1@), where Yo,...,@,—, are ZFC axioms? In this form, 
the problem is obviously of a purely syntactic character. Therefore one might 
hope to solve it by elementary arguments concerning the manipulation of 
symbol strings by sequent rules. (Hilbert also demanded a proof of such an 
elementary nature to recognize "that the generally accepted methods of 
mathematics taken as a whole do not lead to acontradiction"’.) However, by 
Gédel’s Second Incompleteness Theorem, such aconsistency proof for ZFC is 
not possible (cf. X.7). A proof is not even possible if one admits all the auxiliary 
means of the background set theory described by ZFC.* Nevertheless, the 
fact that ZFC has been investigated and used in mathematics for decades 
and no inconsistency has been discovered, attests to the consistency of ZFC. 
In the following considerations we assume ZFC to be consistent. 


42. We investigate the relationship between background set theory and 
object set theory by first discussing Skolem's paradox (cf. §2). In terms of 
ZFC the paradox can be formulated as follows: ZFC, being a countable, 
consistent set of sentences, has a countable model 2 = (A, €“*) according to 
the Lowenheim-Skolem theorem. On the other hand, YI satisfies an {é}- 
sentence @ (derivable from ZFC) which says that there are uncountably 
many sets in A. If for simplicity we again use defined symbols, we can write 


@ = 4x 14y(Function y 4 Injective y a Domain of y = x 
A Range of y <Q). 


@ symbolizes the property of the universe that there exists an uncountable 
set (and hence also that uncountably many sets exist). Since Yl is a model 
of ZFC, we have YW — g, 1.e., there is an a € A (for x) such that 


(*) Wk Tiy(Function y a... a Range of y c w)[a]. 


The set {be Alb €*a} is at most countable because it is a subset of A. 
Therefore in the universe there exists an injective function whose domain is 
{be A|b <a) and whose range is a subset of w. This does not contradict 
(x). For (*) merely says that in YI there is no injective function defined on a 
with values in w“, or more exactly, that there isno b € A such that Function“ b, 
Injective* b, and Domain’ b < 4 «4: in other words, a is uncountable in YW. 

We see from this example that it is necessary to distinguish carefully 
between the set-theoretical concepts (which refer to the universe) and their 
meaning in a model. 


4 Since on the basis of background set theory we proved the correctness of the sequent calculus 
(cf. [V.6.2, 7.5}, the preceding remark says in particular that on this basis one cannot show that 
ZFC is satisfiable, i.c., one cannot prove the existence of a model (A, €*) of ZFC in the universe. 
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Let us consider another example. The set of sentences 
Yi= ZFC vu {c,E€olre R} u {ac =c,|r,s€ Rr Fs) 


is satisfiable, as one can easily show using the compactness theorem. Let 
B = (B, €®) be a model of ¥ (more exactly, the (5)-reduct of a model of Y). 
Then {b € B|b €? a*} is an uncountable set. On the other hand, w® (being 
the set of natural numbers in 23) is Countable? (that is, Countable® w?). 

As before let 8% = (A,¢€*) be a countable model of ZFC. Then 
{a € Ala €* w‘! is countable because it is a subset of A, and we obtain: 


(1) There is no bijection of {b€ B|b €? w*} onto {ae Ala €4 w*), 


since one set is uncountable whereas the other one is countable. At first 
glance (1) seems to contradict Dedekind's theorem, according to which 
every two Peano structures are isomorphic. To analyze the situation, we 
take a formalization y of this theorem as an {¢}-sentence 


y= Vx Vy((Peano structure x a Peano structure y) 
—> x isomorphic to y). 


Then we have 
(2) ZFCE DJ. 


However, (1) and (2) do not contradict each other. (2) merely says that 
in each individual model © of ZFC every two Peano structures are isomorphic 
(in the sense of ©), whereas (1) speaks of Peano structures in different models. 


4.3. We provide a set-theoretic development of first-order logic, l.e., we show 
that its concepts can be based on the concept of set, as we have done already 
for functions and Peano structures. To be specific, we restrict ourselves to 
the symbol set S = {P!, P’,...), where P” is n-ary. Our first goal is to give a 
set-theoretic substitute for S-formulas. 

As a substitute for the variables vg, v;,... we use the elements Q, 1, ... of 
j@. The rbles of the symbols 4, v, 3, = are assumed by the ordered pairs 
T:=(0,0), v =(0, 1, 3:=(0,2) and = :=(Q, 3), respectively. For the 
P" (for n > 1) we take the ordered pairs P* :=(1, x), where x € w — {Q}. 
(Similarly, one could, for example, let ordered pairs (2, x) with x € w stand 
for function symbols. In order to represent symbol sets with uncountably 
many elements, one could use an appropriate set of larger cardinality 
instead of w.) 

Now formulas of the form v, = v,, correspond to triples (x, =, y) with 
x, y € w. These triples are the elements of the set 


At=:=@ x {=} x a. 


Ordered pairs of the form (P*, z), where x € wand zis a function from x into a, 
play the réle of formulas of the form P"z,,,...v (For instance, the 


* Um, — 3° 
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formula P*v,v4v, corresponds to the ordered pair (22, z) with z = {(Q, 1), 
(1, 4), (2, 3)}.) Thus we are led to the set 


At’:= {(P*, z)|x e@and z:x >a}. 


Likewise, one can define the set of all S-formulas set-theoretically to be the 
smallest set A which satisfies the conditions: 


(1) At= u At? CA; 

(2) ifyeA then(a, y)EA; 

(3) ify,z€A then (y, v ,z)eEA; 

(4) ifx ewand yeA then (J, x, y)E A. 


One can now give a natural set-theoretic description of the notions of 
sequent and derivation, developing in this way the whole syntax set- 
theoretically. Semantic concepts such as the notions of structure or of 
consequence can also be introduced set-theoretically. By doing this one can 
obtain a set-theoretic formulation of the completeness theorem. All con- 
siderations can be carried out in L'*' on the basis of ZFC. In particular, the 
completeness theorem can be formalized as an {¢}-sentence and can be 
derived from ZFC. 

What benefits do we obtain from such a set-theoretical treatment? We 
mention three points. 


(1) The mathematical development of first-order logic (as given in the first 
six chapters) can be founded upon the axiomatic basis of ZFC. 

(2) The set-theoretic treatment enables us to deal with uncountable symbol 
sets in a precise manner. Appropriate variations of this approach make 
it possible to define other languages, e.g., languages with infinitely long 
formulas of the form @p Vv @; V @2 Vv ... (Chapter IX). 

(3) In our discussion concerning the formal notion of proof and the scope 
of first-order logic, we did not appeal to the completeness theorem. 
The reason for this was to avoid becoming trapped in a vicious circle 
since the completeness theorem itself requires a proof. In a set-theoretical 
framework one can investigate more closely the assumptions which 
are needed for a proof of the completeness theorem. Doing this one finds 
that a considerably weaker axiom system than ZFC is sufficient for the 
proof (cf. [1]). 


4.4 Exercise. A reader who has been confused by the discussion in this 
chapter says, "Now I'm completely mixed up. How can ZFC be used as a 
basis for first-order logic, while first-order logic was actually needed in order 
to build up ZFC?" Help such a reader out of his dilemma. (Hint :Again be 
careful in distinguishing between the object and the background level.) 


CHAPTER VIII 
Appendix 


In this chapter we note some results which will be needed in Part II and 
which are also of independent interest. 


1 Extensions by Definitions 


We have chosen the symbol set {<) for the theory of partially defined 
orderings and the symbol set {°, e) for group theory. The field of a partially 
defined ordering and the inverse operation in a group are examples of 
relations or functions for which we have no symbol in {<} and {¢,e), 
respectively, although such symbols would facilitate the formulation of 
propositions. In this section we show that the use of additional symbols 
does not increase the expressive power of a language provided they are 
"definable". The field of a partially defined ordering and the inverse opera- 
tion in a group will turn out to be definable in this sense. For motivation we 
first discuss these examples in more detail. 

When one adds to the symbol set {<} a unary relation symbol P for the 
field of a partially defined ordering one has the following definition: 


Wx(Px @ dy(x < yp v y < x)). 


In L'<-*} one can write more transparent formalizations, e.g., the third 
axiom for the theory of partially defined orderings (cf. III.6.4) may now be 
written as follows: 


Vx VVX(Px a Py) > (xX <yvx=yvy <x)). 
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Similarly, the proposition which says that the field of the ordering relation 
contains at least three elements has aformulation in L'<:”} which is both short 
and easy to read: 


dx 34 4z(Px A Py A PZ ATX SYA TX=ZA1 2535). 


Thus the introduction of P is convenient for formalizations. On the other 
hand, it cannot be expected that more propositions about partially defined 
orderings can be formulated in L'<:”? rather than in L'*', since every {<, P}- 
formula can be transformed into an equivalent {<}-formula by replacing 
all subformulas of the form Px by Iy(x < y v y <x). 

Similar considerations apply to definable functions and constants: When 
we add to S,, = {», e} a unary function symbol-' for the group inverse, we 
have the following definition: 


Vx VwW(x) = yoxoy Se). 


Note that the right-hand subformula does indeed define a function in every 
group (regarded as an S,,-structure), since 


®,. Vx a"'yxoy =e. 


In a similar way we can introduce the constant @ for the empty set in 
the set theory ZFC by means of the definition 


VO = xXeVy TV EX); 
for one can show that 
ZFC &@ 4-'x Vy ay ex. 
In the sequel g(vo,..., U,—-1;) stands for a formula @ where the variables 


occurring free are among Uo,..., U,—1- 


1.1 Definition. Let ® be a set of S-sentences. 


(a) Let P ¢ S be an n-ary relation symbol and @(vg,..., v,—1) an S-formula. 
Then we say that 


Vege cs V0 yo (P09 «on Ugg > Oo, «iss Uni) 


is an S-definition of P in ®. 
(b) Let f ¢S be an n-ary function symbol and @(vp,..., U,) an S-formula. 
We say that 


Vig .-- VO, Wa fo --- Un—-1 = ta P(Uo,..-, U-1, Un) 
is an S-definition of f in D provided that 
D & Vug... V0__ 1 d=! tp P(V9, ++ -5 Up): 
(c) Let c¢S be aconstant and ¢(v,) an S-formula. We say that 
Wvo(c = 09 © P(vo)) 
is an S-definition of c in D provided that ® — 37 "vp @(v9). 
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We thus say that Vug(Pv9 © Jvy(vp < vy V Vy < U9) is a {<)-definition 
of P in ®,,.4, Wo Voi(v9 1 = vy v9 ov, =e) is an S,,-definition of ~* in 
®,,, and Voo(O = v9 Vv, 1X, € vo) is an {€}-definition of @ in ZFC. 

Now assume that a symbol set S and a set ® of S-sentences are given. 
Suppose that S“ > S, and that for every symbol in S‘ — S exactly one 
S-definition in ® has been fixed, say, 


for n-ary Pe S* — S the S-definition 
(i) Vvg ... WU_—1(PU9 ... Vv. -1 A Pp(Uo,..., V. -1))s 
for n-ary f € S* — S the S-definition 
(li) Veg... WO,—1 VO S09 «++ Pn—1 SV, A OVO, «ey Un—19 Un))s 
and for ce S* — S the S-definition 
(ili) Voo(c = v9 4 @,(Uo)). 


Let A be the set of definitions in (i), (ii), and (iii). As in the case of the group 
inverse, the interpretation of the symbols in S$“ — S is determined in every 
model of D: 


1.2 Lemma. /f the S-structure W is a model of ®, then there is exactly one 
S*-structure 2“ such that 


(*) Wf S= A and WE A. 

PROOF. Suppose W4fS = Wand WE A 

Then, by (i), 

(1) PH ay... Any iff Wk @plag,.--.4n—14 
for n-ary Pe S* — S anda,, ...,4,-1; € A; 

by (ii), 

(2) f™ (dos-+-5Qn-1) =a, iff WE PrlAg,-+++a,-,,a,] 


for n-ary f € S* — Sanda,, ...,a,-1,a € 4M; 
and by (iii), 

(3) ca iff Ue ofa] 
force S‘ — SandaeéA. 


(1), (2), and (3) show that there is at most one S*-structure Q4 satisfying (+). 
On the other hand, (1), (2), and (3) determine an S*-expansion %&* of W with 
the property 24 A (Indeed, (2) and (3) determine an n-ary function and an 
element of A since by 1.1(b), (c) we have 

UW os Vv vs NOs 1 a Un@ (Uo, -++5 Un—-1, Un) 


and Qf & 47'v_ @~,(vo), respectively.) LJ 
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It is intuitively obvious that the introduction of defined symbols does 
not increase the expressive power: For every formula y containing defined 
symbols, one obtains an equivalent formula WY without such symbols by 
replacing the new symbols by their defining formulas. This is the content of 


1.3 Theoremon Definitions. Let S* > S and ® < L3. Let A be a set of 
S-dejinitions in ©, one for each symbol in S* — S. Then for each  € L3* 
there is a formula w’ € L* such that 


(a) Given an S-structure Y with We @Manda,, ...,a,-,€A, 
WW = Wlaog,..-, An-1] iff We W'[ao,..., 4,1]. 


(214 denotes the S4-expansion of W, which, by 1.2, is uniquely determined 
by the condition 24 — A.) 
(b) DUAEF Woy. 


Proor. As already mentioned, we obtain w’ from w by replacing all symbols 
from S* — S in & by their defining formulas. In this process we must take 
into account the possibility that a term may contain nested function symbols. 
Thus, for instance, the formula 


Ww = Ixfgx = y 


(where f, g € S* — S) will be converted into one of the form 


XU uv 
dx du (oe A nn A v= ») 


We define ¥: LS* = L* by induction on S*-formulas. The definition for 
atomic wW uses induction on m(wW), where m(y/) is the number of occurrences of 
symbols from S* in y, including repetitions. 

If my) = 0 then y* = w. 

If m(y) > O let Xo, x,, X2,... be the enumeration of the variables not 
occurring in ¥ in the order of the enumeration v9, ¥4,.... 


Case 1. W = Pto...ty—1! 


WY = IXg 2. IX 1 ((X0 = to)” Av A nan = tn)” 
A PX6icvX_-1), UPES, 


YW" = 3 Dynal (¥o — fe)” Ratoa es We (X,—14 — oe i 
Gp Oe), ope Sears 
Dy citctaye. 1 eect sO 


(Since P does not occur in x; = t;, m(x; = t) < m(W)) 
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Case 2. is of the form t’ = t". Since m(yv)} > 0, W is not of the form x = y. 
QW afte. hea St 


PY = AX... IX, Ax,((Xo = iy Rant, yg Sitges) 
A(X, Bt A fxXo---Xn-y =X, USES, 


WY = Ax, ... Ix,-1 a (cc = to)’ Avs A (Xq-1 = tn-1)" 


Xg...X, : 
A(X,= 0" A 9 2%), iff eS*-S. 


(2) p=c=t: 


WY = Axo((Xp = 1’ Ac=Xo), ifceS, 


pr = vo ty a o, = ifceS*—S. 


Vo 


(3) y= x = fty...t,-, or W = X =c: analogous to (1) or (2). 


This completes the definition for the atomic case. 
For the induction step we set 


(ayy = ho v Wi) = (5 v YY), and (Ax) = Ixy’. 


Now, upon using this definition, it is not difficult to prove by induction 
that statement (a) of the theorem holds. Furthermore, it is clear that wy’ ¢ LS 
provided w € L°. It is also clear that for 2} Mand a,, ...,4,-,EA 


(4) We Wow')[ao,...,4n-1 1. 

For (b): Let 8 be an S*-structure such that 8 — Mu A, and let 
bo,-++»b,-5€B. We must show that BE (W amW")[bo,..., 5,-,]. For the 
structure WI :-= B [ S, we have U4 = B by 1.2. Thus (b) followsfrom(+). UO 


Given elementarily equivalent S-structures 9 and B, we have for  € L$": 


Wey iff Wey’ 


iff Be yw’ 
iff BAe w 
Hence we have 
1.4 Corollary. [f U = B then 4 = B+. = 


When treating set theory in Chapter VII, we have frequently used defined 
symbols to obtain more appealing formalizations. Now we have justified 
this procedure since, by 1.3(b), a formula W in the extended language can be 
regarded as an abbreviation for the {¢}-formula ’. 
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We point out a further application: An ordering is sometimes considered 
as a structure (A, <), sometimes as a structure (A, I). The theorem on 
definitions implies that the choice of the basic symbols is immaterial, pro- 
vided the symbols are definable from each other. In the case of orderings we 
obtain that L'<) and L's! have the same expressive power. For a precise 
formulation see exercise 1.11. 


A symbol set is called relational if it contains only relation symbols. 
Sometimes it is convenient to be able to restrict oneself to relational symbol 
sets. Using the theorem on definitions, we show how function symbols and 
constants can be replaced by relation symbols in order to obtain a relational 
symbol set. The idea is to consider the graph of a function rather than the 
function itself. 

Let S be a symbol set. For every n-ary f € S let F be a new (n + 1)-ary 
relation symbol, and for c € S let C be a new unary relation symbol. Let S* 
consist of the relation symbols from S together with the new relation symbols. 
Thus S’ is relational. We associate with every S-structure Wan S*-structure 
YI" by replacing the functions and constants by their graphs: 


(1) Av:=A; 
(2) for Pe S, P™ :-= P"; 
(3) for n-ary f €S let F™ be the graph of {™, that is, 
FE prey ee iff Fendi, Oe) = 4a,; 
(4) for c €S let C™’ be the graph of c™, that is, 
Cue al ¢> = 

Correspondingly, one can transform every S-formula into an S*-formula, 
replacing atomic subformulas such asfxy = z by Fxyz and c = x by Cx; 
nested function symbols are treated as in the proof of 1.3. For instance, if 
y = fcgx =c we set Ww" = dy dz(Cy a Gxz a Fyzy). Then for arbitrary y 
we have that y holds in Q if and only if yw" holds in YW’. A precise statement of 
this relationship is contained in the following theorem; the proof follows 


the outline above and is included for the reader who is interested in seeing 
the details. 


1.5 Theorem. For every y € L3 there is yy" € LS” such that for all S-structures 
Wand ay,...,4,-1€ A: 


(+) We Wlao,..., 4n-1] iff W t= W'Ldo,-.-, An—1). 


PROOF. Instead of proving the theorem directly, we refer to 1.3, now letting 
S u S‘ play the réle of S* and S" the réle of S. Further let 
D -—{Vv9... Vo,-1 47 'v, Fug... 0,—10,| f €S and f is n-ary} 
VU {47 ‘D9 Cvg|c € Shs 
A:= {V0 ... V0, 1 Via fo ..+ Un = U_ CO Fog .-- Un— 1p) 
f €Sandf is n-ary} u {Voo(c = 09 Cro) ic € S). 
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For every function symbol f and every constant c in S, A contains an S’- 
definition in ® which says that f and c have graphs F and C, respectively. By 
1.3, for every formula wy € LS’"" we have WY € L®”, and for every S'-structure 
%8 which satisfies P and for all bp,...,b,-, € B, 


(*) BE Wbo,...,b,-1) iff BE W'[bo,..., b,- 1] 


For W € L* set P=. 

Now, in order to prove (+) let Yf be an S-structure. We apply (*) for the 
case B:= YW" (W" satisfies D!). Since W'* — A, the transition from W" to 
91“ means that one adds exactly the functions and distinguished elements 
which were eliminated in passing from YW to YW". Hence W* | S = YW. Thus 
for yy € LS and ao,...,a,-1 € A, 

We Whdg,-.-.4n—4] iff i Wldag,-..,4n-14 
iff QW — Wldag,.--54,-1] (coincidence lemma) 
iff Wk W'Lao,...,a,-1] (by (+), 


and since wY = w", the theorem is proved. C) 


Given S-structures 2 and 8 such that 2” = 8B", we have by 1.4 that 
ars = Br As WATS = Wand BTS = B it follows that W = B. Hence 
we have obtained 


1.6 Corollary. For any Yf and 8, YW’ = B" implies W= B. 


The converse of 1.6 can be shown similarly. Define 


A = {Wo ene Vv, —1 VOC F v9 es Un—1Un  fvo +12 U,-1 = val f ES is n-ary } 
U Wup(Cv9 oc = v9)|CES). 


Then for every new relation symbol in S” — S, A contains an S-definition in 
the empty set of sentences. Clearly for every S-structure Mf, UW“ S" = W”. 
Now, if % = %, then 1.4 yields 2(4 = 4, and hence by the coincidence 
lemma we have YI" = 8B", Together with 1.6 we obtain 


1.7 Theorem. For any Y and 8, W = B if and only if W = B". LI 


We shall use 1.7 in some discussions on elementary equivalence when it is 
convenient to restrict to relational symbol sets. 


1.8 Exercise. In the notation of 1.3, show that® U At w iffO & yw’. 


1.9 Exercise. Let us call a formula y term reduced if its atomic subformulas 
are of the form PX9...X,-1,X% = ¥, fXo..-X_-1 =X,, Or c = x. Show that 
every formula is logically equivalent to a term reduced formula with the 
same free variables. 
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1.10 Exercise. Prove theorem 1.5 in the following way: Using 1.9 note that 
one just needs to consider term reduced formulas. Then give an inductive 
definition of yw" for term reduced formulas. 


1.11 Exercise. Show that for any g¢€ L{<! there is a w € Lb*! and for any 
yw € Li! there is ap € Lb<! such that 


(a) an ordering (A, <%) satisfies @ if and only if the corresponding ordering 
(A, <*) satisfies I), 

(b) an ordering (A, <%) satisfies y if and only if the corresponding ordering 
(A, <4“) satisfies @. 


§2. Relativization and Substructures 


If one regards a vector space as a one-sorted structure, then the domain 
consists of scalars and vectors (cf. III.7.2(2)). When formulating the vector 
space axioms in the corresponding language, one must relativize the field 
axioms to the set of scalars and the group axioms (for the vectors) to the set 
of vectors. For the field axiom Vx x-1 = x this can be done by using the 
relation symbol F for the set of scalars and reformulating the axiom as 
Vx(Fx > x.1 = x). Similarly, the S,,-formula 


g:=Vx(x =O0v x= 1), 
when relativized to Ff, becomes 
of := Vx(Fx > (x = 0 v x = 1)). 


In a vector space, g* just says that the field of scalars satisfies @. This section 
deals with the relation between a formula and its relativization. First it is 
useful to introduce the notion of substructure. 


2.1 Definition. Let 29 and & be S-structures. Then Yf is called a substructure 
of 8 (written: Yc 23) if 


(a) ACB; 
(b) (1) for n-ary PeS, P™= Pn A® (that is, for all ao,...,@,-1€A, 
Pe Gig od hy cg IEE Pp tind dy og) 
(2) for n-ary f € S,f™ is the restriction off" to A’; 
G)iorcescc a. 


For example, 9t = (N, +%,-%,0, 1) is a substructure of the field R = 
(R, +®, -®,0, 1) of real numbers. 

If YW < %, then A is S-closed (in 8B), that is, A # @, c"'e€ A force S, and 
dy,+.+,4,—-,€A implies f(ag,..., Gn) € A for fES. 
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Conversely, every S-closed subset X of B is the domain of exactly one 
substructure of 23, because in this case the conditions in 2.1(b) determine 
exactly one structure with domain X. We denote this substructure by [X]”. 

For example, the set {ré R|r > 0} is S,,-closed in SR and hence is the 
domain of a substructure of R, but the set {re R|r < 0} is not S,,-closed, 
since (—1)-(—1) isnot <9, 


2.2 Lemma. Let % and 23 be S-structures such that 2 < 23, and let 
B: {v,|n EN} > A he an assignment. Then, for every S-term t and for every 
atomic S-formula , 


(UW AO) =(B BPO, WADE HF (BH\eE oe. 
The proof by induction on terms is straightforward. 


The result 2.2 does not hold for arbitrary @: the formula 4x x + 1 =0 
holds in R but not in its substructure Yt. 


2.3 Definition. Let S be a symbol set and let P be a unary relation symbol. 
By induction we define, for every y € L‘, the formulaw? € LS“'"! the so-called 
P-relativization of y: 


pPi=w, ify is atomic; 
pay := ay; 
(Wo Vv Wi)? = (Wo Vv Wh): 
[Axw]? = dx(Px a w°). 


Thus [Vxw]? = [73x nw]? = 74x(Px a mW), and this formula is 
logically equivalent to ¥x(Px > W”). Moreover, it is clear that free($) = 


free(y’). 


2.4 Relativization Lemma. Let 21 be an S u {PlI-structure such that P* < A 
is an S-closed set. Then for w ¢ L§, 


[PAP ey if Ue yp 


2.4 describes the relationship sketched at the beginning of this section: 
the relativization w’ says the same in YW as W does in the substructure [P 4}. 


PRoor. We show by induction on w € L*: 


(*) For all assignments f: {v,|ne N} > P4, 


(CP Bey iff QL pe py’. 
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If w is atomic, then w? = w, and we obtain (*) from 2.2. For Y = Wo o,r 
wv = (ho Vv W1), (*) follows directly from the induction hypothesis for Wo 
and wW,. In case w = 4xWo we argue as follows: 


a 
([P’}", B) k= Ixwe iff for some ae P*, (ay B “| FE Wo 
iff for some ae P’*, (2 Bp :| E ws (by induction 
ss hypothesis for Wg) 


iff for some ae A, (x. b KE Px a we 
iff (QU, B) = Ax(Px a wh). a 


2.5 Exercise. Let U and V be distinct unary relation symbols, U, V¢ S. 
Assume (I, U“, V“) to be an S u {U, V}-structure such that U* and V* are 
S-closed and U“ < V4. Show that for g € L3 


(Wt, U4, V4) & (eT 9"). 


2.6 Exercise. A formula of the form 4x9... 4x,_;@ (VXo...V¥X,— 4,9), where 
n > OQand @does not contain any quantifiers, is called an existential formula 
(universal formula). Show: 


(a) If Qc B, @ is an existential sentence and YW = 9, then 23 — g. 

(b) If 2U < B, g is a universal sentence and B — g, then WE ¢. 

(c) In the language L** there is no system of axioms for group theory 
consisting only of universal sentences. 


§3. Normal Forms 


In this section we show that one can associate with every formula a logically 
equivalent formula which has a special syntactic form. 

Let S be a fixed symbol set. For an arbitrary set ® of S-formulas let <> 
be the smallest subset of L* containing ®, and containing with any w and x 
the formulas ~y and (W v x). Note that ® < L3 implies <®) < L*. 


3.1 Lemma. Let © < L>. Suppose Y and 23 are S-structures, and do,..., 
a,_, € A, bp,..., b,-, € B. If 

) Wie Pldg,...,%-1) iff BE glby.....b,-1] 

holds for all @ € ®, then (*) holds for all 9 € <P). 


ProorF. The set of g for which (*) holds includes ® and with any w and x also 
contains Tw and (W v 7). 
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3.2 Lemma. Let D = {@,..-, @,} he a finite set of formulas. Then every 
satisfiable formula in <®) is logically equivalent to a formula of the form 


(+) (Woo Att A Won) Viet V Wo A A Wands 


where k < 2"*! and for i<k and j <n, W;,; = 9; or Wi; = 19;. In par- 
ticular, there are only finitely many pairwise logically nonequivalent formulas 
in (QO). 


Thus we see that every formula in (CO) is logically equivalent to a dis- 
junction of conjunctions of formulas from {@o,.--3 Ons Tos -++5 1Qns- 


Proor. We choose an r such that ® = {@o,..., ,} © L®. For a structure 


Q and an r-tuple d :=(a,, ...,a,-,)€A’ let 
(1) Wea. t) = Wo Ao A Wns 
where 
ae he if We glao,...,4,-1], 
a2, it QU 1p fags ey 
Then 
(2) We Wa.alaos.+-+4—i, 


and W,g 1s a conjunction of the form in (+). Moreover, for any 23 and 
bo,...,5,-1 € 23, 


(3) Bt Wa jlbo....,%-1] 
iff fori = 0,...,n: 
Wt gfao,...,4,1] iff BE olbo,..., 5-1] 
iff (cf. 3.1) for all pe (0): 
WE Ogi nce Wath 23 ol bg Pei 
From (1) it follows that the set 
{Wi Z| Yl is an S-structure and d€ A’) 


has at most 2”*' elements. 

The proof is complete if we can show that every satisfiable @€ CO) is 
logically equivalent to the disjunction 7 of the finitely many formulas from 
the set 


{Wars |W is an S-structure, de A’, Wh efao,..., a,-1]}. 
In a suggestive notation we write 


= \/ {Wea ay | is an S-structure, d € A", WE @ldo,..., a,—1]}. 
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To verify the equivalence between @ and y, assume first that BE 
o[bo,..., 5,1]. Then Wy f) is a member of the disjunction x. Since 23 f= 
Ws,bLbo....,b, 41 (cf. (2) jit follows that B & xLbo..--, ,—1 ]. Conversely, 
if Be xLbo,...,5,-,], then by definition of y¥ there is a structure YI and 
there are a,, ...,4,—, € A such that 


YW f= olag,...,4,-1] and BE War, albo....,5,-11- 


Then by (3), bo,...,b,-, satisfy the same formulas of ¢<®> in 23 as 
dy,+-.,4-1 do in UW. In particular, Bt olbo,..., b,-4 J. ea 


A formula which is a disjunction of conjunctions of atomic or negated 
atomic formulas is called a formula in disjunctive normal form. A formula 
which contains no quantifiers is said to be quantifier-free. As a corollary to 
3.2 we obtain 


3.3 Theorem on the Disjunctive Normal Form. /f' @ is guantifier-free, then @ 
is logically equivalent to a formula in disjunctive normal form. 


PROOF. Let ~ be a quantifier-free formula. If @ is not satisfiable then @ is 
logically equivalent to Tvp9 = vo. If @ is satisfiable and Wo,..., yw, are the 
atomic subformulas in @, then pg € ({Wo,.-., Wat >. The theorem now follows 
from 3.2. CI 


We turn to formulas which also contain quantifiers. A formula y is said 
to be in prenex normal form if it has the form QpXp9..- Qm—1Xm—1Wo, Where 
Q; = 3 or Q; = V for i < mand Wo is quantifier-free. Og X9.-.Qm—1Xm—1 1S 
called the prefix and Wo the matrix of w. 


3.4 Theorem on the Prenex Normal Form. Every formula @ is logically 
equivalent to a formula if in prenex normal form with free(@) = free($). 


PROOF, First we note some simple properties of logical equivalence. By 
“@ ~ w” we mean that ¢ and w are logically equivalent. 


(1) Ifge~ y, then Ne ~ Ty. 

(2) If@o ~ go and g, ~ Wy, then (@o V G1) ~ (30 V Wi). 

(3) Ifoe~ wand QO = 30rQ =V, then Oxg ~ Oxy. 

(4) NAx@ ~ Vx 4@, 7Vxcp ~ 4x Tg. 

(5) If x € free($), then (Axo v W) ~ Ax(g v Ww), (Vxcp v W) ~ Yx(@ v §$), 
(Gy v dxg) ~ IxQ v @), and ( v Vxcp) ~ Vx v @). 


We shall see how one can transform a given formula into prenex normal form 
by repeated application of (/)-(5).For instance, if @ = T4xPx v VxRx we 
can proceed as follows: 


T4xPx v VxRx ~ Vx TPx v VxRx_ (by (2)and (4)) 
~ Vx Px v VyRy (since VxRx ~ VyRy and by (2)) 
~ Vx(4 Px v VyRy) (by (5)) 
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In general we argue as follows: For g € L* let qn(@) be the quantifier number 
of g, i.e,, the number of quantifiers occurring in m. Using induction on n, we 
prove: 


(x), For @ with qn(~) < n there is a I) € L‘ in prenex normal form 
such that g ~ J), free(w) = free($), and qn(g) = qn). 


We leave the arguments for “free(@) = free($)" to the reader. 
n = 0: If qn(g) = 0, @ is quantifier-free and we can set ]) := @. 


n > O: We show (*), by induction on @. Suppose qn(@) < n. The quantifier- 
free case is clear. If ¢ = 19’ and qn(@) > O, then qn(g’) = qn(@) > O, and 
by induction hypothesis there is a formula of the form Qxy which is a prenex 
normal form for @’ (where gn(Qxy) = qn(@) and where x may contain 
quantifiers). By (1) and (4), @~ Q~'x- (where V>! = J and J7'+Y). 
Since qn(7 74) = qn(Qxy) — 1 = qn(@~) — | <n — 1, there exists a formula 
W logically equivalent to —y which is in prenex normal form such that 
qn(yv) = qn(7). By (3), 7} xy is a formula logically equivalent to @ with 
the desired properties. 


Let p =(@’ v o’) and let qn(~) > 0, e.g., qn(@’) > O. By induction 
hypothesis there is a formula of the form Qx7 which is a prenex normal form 
for g’. Let y be a variable which does not occur in Qxy or in @”. It is easy to 
show that 


y 
Oxy ~ Qyy — 
x 


and thus, by (2) and (5), to obtain 


; ta y ty 
p=(@9 vo ~ (axe v 0 


y ef 
orf; V@ ) 


Since qn * v ~") =qn(~) - 1 <n — 1, we can find a formula J) in 


prenex normal form which is logically equivalent to (y2 v oy”). Ovw has 
x 


the desired properties. 

Let g = Ixg’. Since qn(g’) <n — 1 there is a formula J)' in prenex 
normal form which is logically equivalent to g’. dxy’ is a formula in prenex 
normal form which, by (3), is logically equivalent to @ and has the same 
quantifier number as g. Cl 


3.5 Exercise (Conjunctive Normal Form). Show that if @ is quantifier-free, 
then @ is logically equivalent to a formula which is a conjunction of dis- 
junctions of atomic and negated atomic formulas. 
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3.6 Exercise. Let § be a relational symbol set and let ~ € L§ be of the form 
IXg..- dX, V¥o.-- Vem, where y is quantifier free. Show that every model 
of g~ contains an (n + 1)-element substructure which is also a model of ¢. 
Conclude that the sentence Vx Jy Rxy is not equivalent to any {R}-sentence 
of the above form. 


PART B 


CHAPTER IX 
Extensions of First-Order Logic 


We have seen that the structure Jt of natural numbers cannot be charac- 
terized in the first-order language corresponding to 9¥t. The same situation 
holds for the class of torsion groups. As we showed in Chapter VII, one can, 
at least in principle, overcome this weakness by a set-theoretical formulation: 
One introduces a system of axioms for set theory in a first-order language, 
e.g., ZFC, which is sufficient for mathematics, and then in this system carries 
out the arguments which are required, say, for a definition and characteriza- 
tion of 9t. However, this approach necessitates explicit use of set theory to 
an extent not usual in ordinary mathematical practice. 

The situation may act as a motivation to consider languages with more 
expressive power which permit us to avoid this detour through set theory. 
For example, in a second-order language we can directly characterize the 
natural numbers by means of Peano’s axioms. However, already at this 
stage we wish to remark that in order to set up the semantics of such a 
language and to prove the correctness of inference rules, one has to make 
more extensive use of set-theoretic assumptions (for example, of the ZFC 
axioms) than for first-order logic. 

There is a further reason for introducing and investigating more powerful 
languages. We saw that results such as the compactness theorem are useful 
in algebraic investigations (cf. VI.4). Therefore it seems worthwhile to seek 
other more expressive languages in the hope of obtaining tools for more 
far-reaching applications in mathematics. 

In this chapter we introduce the reader to some of the languages which 
have been considered with these aims in mind. 


1X. Extensions of First-Order Logic 


§1 Second-Order Logic 


The difference between second-order and first-order languages lies in the 
fact that in the former one can quantify over second-order objects (for 
example, subsets of the domain of a structure) whereas in the latter this is 
not possible. 


1.1 The Second-Order Language L3,. Let S be a symbol set, that is, a set of 
relation symbols, function symbols and constants. The alphabet of Li 
contains, in addition to the symbols of L*, for each n > 1 countably many 
n-ary relation variables VG, Vi, V5,....To denote relation variables we use 
letters X, Y,.... We define the set Be of second-order S-formulas to be the 
set generated by the rules of the calculus for first-order formulas (cf. 1.3.2), 
extended by the following two rules: 


(a) If X is an n-ary relation variable and fo,...,f,— 5 are S-terms, then 
Xtg...¢,—, is an S-formula. 
(b) If@is an S-formula and X isa relation variable, then 3X gis an S-formula. 


1.2 The Satisfaction Relation for L>. A second-order assignment } in a struc- 
ture 2 is a map which assigns to each variable v; an element of A and to each 
relation variable V" an n-ary relation on A. We extend the notion of satis- 
faction from L* to L> by taking (a) and (b) into account as follows: If QW is 
an S-structure, y a second-order assignment in 2, and 3 = (QI, y), then we 
set 


(’) SBE Xty...t,—, iff y-X) holds for S(t), ..., 3(t,— ,). 
C C 
(b') 3. 4X@ iff there is a Cc A® such that 3 x w) (where 3% = 


C : ; ; 
(x. » S} and + 5 is the assignment which maps X to C but which otherwise 


agrees with y }. 


We write # to denote second-order logic, that is, the logical system given 
by the languages L* together with the satisfaction relation for these languages. 
Similarly, we denote first-order logic by #,. For the present we still use the 
term "logical system" in the naive sense. A precise definition will be given 
in XID. 


1.3 Remarks and Examples 


(1) One defines the free occurrence of variables and relation variables in 
second-order formulas in the obvious way and can then prove the analogue 
of the coincidence lemma. In particular, when @ is an L}-sentence, i.e., a 
formula without free variables or free relation variables, it is meaningful to 
say that Wis a model of @, written WE g, 
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(2) Let VX@ be an abbreviation for 74.X 71¢. Then 
o 
JEVXo iffforall Cc AN STE Q. 


(3) If X is a unary relation variable, then the following formalizations of 
Peano’s axioms, 


(P1) Vx nox =0; 
(P2) Vx Vy(gx = gy > x = y); 
(P3) VX(X0 4 Vx(Xx > Xax)) = VpXy), 


which we had in III.7.3, are L{t-°-sentences. Hence by passing from first- 


order to second-order logic we have gained expressive power since no first- 
order axioms can characterize the structure (N, a, 0) up to isomorphism. 


(4) Let S be arbitrary. Then the Lj-sentence 


GS Vx V(x = yOVX(XxOXy)) 


is valid: two things are equal precisely when there is no property which 
distinguishes them (the identitas indiscernibilium of Leibniz). Thus in the 
development of L} we could have done without the equality symbol using 
(+) to express equality. 


(5) When setting up the second-order languages we could have introduced, 
in addition to relation variables, function variables which can also be quan- 
tified. This procedure would increase convenience, but not the expressive 
power of the languages. We illustrate this by means of an example (cf. the 
elimination of function symbols in VIII.1). 


Let g be a unary function variable and let @ be the "second-order formula" 
Yg(Vx Vy(gx = gy 7x = y) > Wx dyx = gy). 


Then (for the natural extension of the notion of satisfaction) the following 
holds: 


WE qg iff every injective function from A to A is surjective 
iff A is finite. 
Considering the graph of a unary function instead of the function itself, 
we can use a binary relation variable and rewrite @ as 
VX((Vx 3° 'y Xxy a Vx Vy V2((Xxz a Xyz) > x = y)) > Vx dy Xyx). 
Call this formula @,,,,. @ and @,;, have the same models. Therefore, 
Wi= g,,, iff A is finite. 


In later examples we shall often use function variables to obtain formulas 
which are easier to read. 
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(6) In #, one can introduce operations such as substitution and relativiza- 
tion (cf. VIII.2.3) by definitions analogous to those for O, -One can also 
verify basic semantic properties such as the analogue of the isomorphism 
lemma. 


The situation is different when we consider deeper semantic properties 
such as the completeness theorem, the compactness theorem and _ the 
Léwenheim-Skolem theorem: the price we have to pay for being able to 
quantify over second-order objects is the loss of all these central properties. 
1.4 Theorem. The compactness theorem does not holdfor #1. 


Proor . The following set of sentences is a counterexample: 


{Print U {Poni = 2}. 


This set is not satisfiable, but, of course, every finite subset is satisfiable. [] 


1.5 Theorem. The Léwenheim—Skolem theorem does not hold for #,. 


Proor. We give a sentence @,,,, € L such that for all structures W, 


WE g,,. iff A is uncountable. 


Then @ync 1S Satisfiable but it has no model that is at most countable. 
To define @,,- we use an L?-formula @;,,(X) with just one free unary 
relation variable X, for which 


(My) K Og X) iff 7X) is finite. 


It is easy to obtain such a formula by modifying @,;, as given above. Clearly 
a set A is at most countable if and only if there is an ordering relation on A 
such that every element has only finitely many predecessors. So let us define, 
using a binary relation variable Y, 


Pea = 3Y(VX 7 Yxx a Vx Vy V2((Y¥xy a Yyz)— Yxz) 
A Wx Vy(Yxy v xX = y v Yyx) 
AVX AX(@rin(X) A V(X yO Yyx))). 
Then we have 


WE Peon iff A is at most countable 


and hence we can set @ync'= TP <ctbi 


1.6. For first-order logic we obtained the compactness theorem from the 
existence of an adequate system of derivation rules (cf. VI.2). For 4, there 
is no correct and complete system of derivation rules. Otherwise we could use 
the same argument as for #; to prove the compactness theorem for #y,. 
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This negative result does not, of course, hinder us from setting up correct 
rules for second-order logic. For example, one can add to the first-order 
rules the following correct rules for quantification over relation variables: 


Tre . T oy 


rixe TiXo W’ if X is not free in Tw. 


In the introduction to this chapter we provided two motivations for 
investigating more expressive languages, namely: (a) to facilitate the forma- 
lization of mathematical statements and arguments, and (b) to supply us 
with more powerful tools for mathematical investigations. In regard to (a) 
and (b), what have we accomplished by second-order logic? 

To begin with, we note that by supplementing the second-order rules 
presented above, one can obtain a system largely sufficient for the purposes 
of mathematics. (However, by 1.6, one never gets a complete system, so that 
the choice of rules can only be made from a pragmatic point of view, and not 
with the aim of attaining completeness.) In addition, bearing in mind that 
mathematics can be formulated more conveniently in a second-order 
language, one can tend to the opinion that progress in the sense of (a) has 
indeed been made. However, as far as (b) is concerned #y is hardly an 
appropriate system. The results 1.4 and 1.5 already hint at this. The expressive 
power of second-order languages is so great that results such as the compact- 
ness theorem or the Lowenheim-Skolem theorem, which are of value for 
mathematical applications, no longer hold. In view of these remarks it is 
natural to investigate other extensions of first-order logic (cf. §2, §3). 

By considering a further aspect we explain how, in a certain sense, second- 
order logic has overshot the mark: We show that set theory, as based, e.g., 
on ZFC, is not sufficient to decide basic semantic questions for “,. This 
follows because we can write down a sentence qy € Li? which is valid if 
and only if the continuum hypothesis CH holds. Since neither CH nor its 
negation can be proved in ZFC (cf. VII.3), the validity of meq can neither be 
established nor refuted within the framework of ZFC. 

CH says: 


(1) For every subset A of R, either A is at most countable, or there is a 
bijection of R onto A. 


cy Will be essentially a formalization of (1). First, similar to @ za), we can 
easily give a formula @ ~ .4,(X) with the property 


(QW, y) = Pecyi(X) iff p(X) is at most countable. 
Further, there is a formula @p such that 
(2) AF @piff A and R have the same cardinality. 


To obtain @g, note that the ordered field R* of real numbers is, up to iso- 
morphism, the only complete ordered field. Therefore, if y is the conjunction 
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of the axioms for ordered fields and of the second-order S,;-sentence 


VX((AxXx a dy V2Xz 372 < y)) ~ Av(W2(Xz — (7< yp v z= y)) 
A Yx(x < yp d2(x < za Xz)))) 

("every nonempty set which is bounded above possesses a supremum"), 
then for all S;,-structures QI, the following holds: 
(3) We wiffA = R*. 
Hence, in order to obtain (2), we can choose as @g an Li?-sentence which 
Says: 

"There are functions +, -, elements 0, 1, and arelation < such 

that y”. 


(We leave it to the reader to write down @g as a second-order sentence.) 
Now we can take as @cy a sentence which says that "if the domain is of the 
same cardinality as I, then every subset of the domain is either at most 
countable or else of the same cardinality as the whole domain", 
cu = Pr 7 VX(@cerw(X) V Ag(VxX gx 
A Vx VyWgx = gy 7x =y) a Vy(Xy > Axgx _ y))). 

It is easy to prove (cf. (1)) that 

FQcy iff CH holds. LO 


1.7 Exercise (The System #y of Weak Second-Order Logic). For every 8, 
let L#5 = L}. Change the notion of satisfaction for & by specifying, for 
3 = (Y, y) and n-ary X, that 


3 ,,4X o_ iff there is a finite C < A" such that 3 < Fw Q- 


Thus only quantification over finite sets (and relations) is allowed. 
Show: 


(a) There is a sentence g and a structure 2 such that WeE,, g but not WE ¢g. 

(b) For each sentence @ € Ly*, there is a sentence w € L} such that for all 
S-structures W, We, @ iff We w. 

(c) The compactness theorem does not hold for #7. 


(However, the Lowenheim-Skolem theorem does hold for #7; cf, exercise 
2.7 in the next section.) 


§2. The System 2,4 


In VI.3.5 we showed that the class of torsion groups cannot be characterized 
in first-order logic. But we can axiomatize this class if we add to the group 
axioms the "formula" 


(*) Vx(x =eOvxoxB=e@vxeoxox=ev---), 
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Thus we gain expressive power when allowing infinite disjunctions and 
conjunctions. Such formations are characteristic of the so-called infinitary 
languages. In the simplest case one restricts to conjunctions and disjunctions 
of countable length. This leads to the system #,,,,,. (The notation #4 ,. 
follows the systematic terminology usual in the study of infinitary languages, 
cf. [7]). 

To define the formulas of ”,,,.. we use the jargon of calculi. Nevertheless 
it should be noted that the rule in 2.1(b) below is not a calculus rule in the 
strict sense, since it has infinitely many premises. (For example, in order to 
obtain the formula (*) one must already have obtained the formulas x = e, 
Xo xX =e,....) A precise version of such "calculi" and their usage can be 
given within the framework of set theory (cf. VII.4.3). For example, the 
definition of formulas and proofs by induction on formulas can be based on 
the principle of transfinite induction. 


2.1 Definition of /,, ,,. Compared with the first-order language L*, we 
add the following to constitute the language is 

(a) the symbol VV (for infinite disjunctions); 

(b) to the calculus of formulas the following "rule": 


If ® is an at most countable set of S-formulas, then \/® is an 
S-formula (the disjunction of the formulas in 9); 


(c) to the definition of the notion of satisfaction the following clause: 


If ® is an at most countable set of L3 .,-formulas, W an S- 


structure, f an assignment in W and 3 = (YI, f) then 
3 \/® iff 3 @ for some ge ®. 


There are many classes of structures which can be characterized in L,,,~, 
but not in first-order logic. Examples are: 


the class of torsion groups, characterized by the conjunction of the group 
axioms and 
Vx \/{xo-+-ox =eln> Lh, 
Vi | } 
n-times 


the class of fields with characteristic a prime, by the conjunction of the field 
axioms and 


I . = 
Vil + s + 1 = Ojn prime), 
n-times 


the class of archimedean ordered fields, by the conjunction of the axioms for 
ordered fields and 


Vx Vix<1lt---+ 1 [n> 1, 


n-times 
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the class of structures isomorphic to (N, a, 0), by the conjunction of the first 
two Peano axioms and 


Vx \/{x =g---g O|n> 0}. 


n-times 


2.2 Remarks. 
(a) For a set ® which is at most countable let /A® be an abbreviation for 
the Ly,.-formula 7\/{7@|¢ € 9}. Then 


3 A® iff forall ge®, JE ~@. 
/\® is called the conjunction of the formulas in ©®. 


(b) The definition of the set SF(@) of subformulas of a formula ¢ in L,,, 
is obtained from the corresponding definition for first-order formulas in 
11.4.5 by adding the clause SF(\/®) = {\/®} U wee SF($). It can be 
proved for arbitrary @ that SF(@) is at most countable. The proof is by 
induction on formulas; we give the \/-step: Let pe = \/®, where by induction 
hypothesis SF(+) is at most countable for every ¥ € ®. Since SF(g) = {g} u 
ees SF(+) is an at most countable union of at most countable sets, SF(@) 
is at most countable. In particular, for every g € L§,,, there exists an at most 
countable S' < § such that ge LS. 


(c) Define the set free(\/®) of the variables occurring free in \/® to be 
ye free(y). The formula \/{en = ¥,|n€N} has infinitely many free 
variables. But one can easily prove by induction that in case free(@) is finite 
then so is free(+) for any subformula wy of @. In particular, subformulas of 
an £4,.-sentence have only finitely many free variables. 


Consider the L® .,-sentence 
Wein = \/{1@z,|n = 2}. 
Then for every structure %{f we have 
We Wi, iff A is finite. 
Hence the set of sentences {Wrin} U {p,,|n = 2) isan example showing 
2.3 Theorem. The compactness theorem does not holdfor 29 ,.: [es] 
Nevertheless, many results for Y have their counterparts in %,,,4. We 


mention some examples and refer the reader to [20] for more information. 


(1) The analogue of the Lowenheim-Skolem theorem holds (see 2.4 below). 
(2) Extend the sequent calculus for first-order logic by the following "rules" 
for \/: 


r 
(\VA) Wey (\/) se" ge: 
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where [I stands for a finite sequence of L,,,-formulas. In this way one 
obtains a correct and complete "calculus": for %,,,,-sentences @o, 
Pi,+-+5Q@n—5 and @, the sequent @o, P,... Gy-1@ 1S derivable if and 
only if it is correct. However, one must allow infinitely long derivations as 
is obvious from (\/A). 

(3) An analysis of (2) shows that by suitably generalizing the concept of 
finiteness one can transfer other results from Y, to #,,,,.. Among these 
is the Barwise compactness theorem, cf. [1]. 


2.4 Léwenheim-Skolem Theorem for /,,,.,. Every satisfiable 2 ,,,,-sentence 
has a model over an at most countable domain. 


Since for every &,,,.-sentence @ there is an at most countable S such that 
QE bee 2.4 follows directly from 


2.5 Lemma. Let S be at most countable, @ € ‘beer and let 8 be an S-structure 
satisfying @. Then there is an at most countable substructure 2l < 8 such that 
WE oe. 


PROOF. We first present the idea of the proof. 

Let By be a nonempty at most countable subset of B which is S-closed, 
that is, which contains all c® for c € S and which is closed under application 
of f® for f € S. Then Bo is the domain of an at most countable substructure 
B, of B. If one tries to prove by induction that Bo = ¢, the proof breaks 
down at the point where 3-quantifiers are considered. For example, in a 
simple case where @ is of the form 4x Px, one must ensure that there is a 
be By such that P®b. Therefore we shall close By with respect to all possible 
existential requirements arising from subformulas of @. 

We now begin the proof. For pairwise distinct variables X9,..., X,»— 5 we 
write Wi(Xg,-.+5X,—1) to denote a formula wy with free($) < {Xxg,...,Xn—a}- 
Define a sequence A,, A,, A,, ... of at most countable subsets of B so that 


(a) An © Amst; 
(b) for n-aryf €S anda,, ...,Q,—5 € Ams f (dos +++54n-1)€ Amos 
(c) for W(xo,..., X,) € SF(g) and a,, ...,a,-, EA, if 


B = dx, WLao, Sees a,-1] 
then there is an a, € A,,,, such that B & W[dg,.. +5 Gn—1> Bl 


Let A, be anonempty at most countable subset of B which contains {c®|ceS}. 
Suppose A, is already defined and is at most countable. In order to define 
A, +, we first set 


A} f agseigG,2) ln = Tt eS Meaty 74,5 05.7 © A): 


Aj, is also at most countable. Now, for W(xo,..-,X,) € SF(@p) and do,..., 
d,-, A, such that Bf 4x,Wf[ag,...,a,—,], we choose an element be B 
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such that 8 & wWlao,...,a,—,, b]. Let Ay, be the set of b's chosen in this 
way. Since SF(@~) and A, are at most countable, so is Aj,. Hence Ay,.1 27 
A, u A; uA isalso at most countable, and (a)-(~are satisfied. 
Now let 
A:= J A. 


me 


Then 


(1) A is at most countable. 
(2) A is S-closed. (By choice of A,, we need only show that A is closed under 


the functions f®. Let f € S be n-ary and a,, ...,a,—, € A. Since the sets 
A, form an ascending chain, a,, ...,a,-, lie in some A,. By (b) the 
element f*(ao,...54@,—1) lies in Ay, and hence also in A.) 


By (1) and (2), A is the domain of an at most countable substructure YI 
of 8. Therefore we are done if we can show: 


(*) WE oe. 

(*) follows immediately from the following claim: 

(x*} For all (xo, ..-,;X_,—-1) € SF(@) and alla,, ...,a,-, 6A, 
We Wlao,...,4n-1] iff BE Wlao,..., 4-1]. 


We prove (**) by induction on y, but limit ourselves to the J-case. 
Let W(Xo,..+5Xn—s) = AX, Y(Xo,.--, X,), and suppose a,, ...,4,-1€ A. 
If UW Ax, xLao,..., 4,1} then we obtain successively: 
There is a € A such that WE y[ao,.-., 4-1) al. 
There is a € A such that 8 f= y[ao,...,4@,-1,a] (Ind. hyp.). 
Be Ax, 7~[d0,...,a a]. 


Conversely, if B & 4x, 7[@o,...,4,-,], we choose k such that a,, ..., 
a,— © A,, and we obtain successively: 


There is a € A, 4, such that B & 7[ao,---,4,-1,a] (by (c)). 
There isa € A,,, such that We y[do,..-,4,-,,a] (Ind. hyp.). 
We Ix.xLdo,..-, 4n-1]. O 


Consider an at most countable set @ of first-order sentences and let 
g:= /\®. Then it follows from 2.5 that every model of ® has an at most 
countable substructure which is also a model of ®. In particular, this yields 
a proof of the Lowenheim-Skolem theorem for first-order logic which does 
not rely on the proof of the completeness theorem. Note that an Yu, o> 
sentence characterizing (N, a,Q) has no uncountable model; hence in 


L,0 We do not have an analogue of the upward Lowenheim-Skolem 
theorem VI.2.3. 
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To conclude this section we give a mathematical application of 2.5 by 
choosing @ appropriately. 

We consider groups as S-structures with S = {o,e, ~*}. A group & is 
said to be simple if {e°) and G are the only normal subgroups of &. If for 
aé G we denote by <a>® the normal subgroup of & generated by a, then 
clearly © is simple if and only if <a>® = G for all ae€ G such that a # e®. 

Since 


C@)— = {90a7go'...g,a7"g, 1|NEN, Z9,.--, Zn € Zy GJos-+-s In € Gh, 


the class of simple groups can be axiomatized in L§.,, by the conjunction 


@, of the group axioms and the following sentence: 


Vx(1x =e > Vy \/{d09... 0, {Ly S v9 x09)... vg x0, *| 
Zo,-++,2Z, EZ} (NEN). 


Using 2.5 we now show 


2.6 Proposition. [f © is a simple group and M a countable subset of G then 
there is a countable simple subgroup of © which contains M. 


Proor. Let S:=S u_{c,|a¢ M), where the c, are new constants for a € M. 
We expand & to an S-structure &, interpreting each c, by the corresponding 
a, and apply 2.5 to G and @p. LI 


2.7 Exercise. Show that for every Ly*-sentence @ (cf. exercise 1.7) there is 
an LS .,-sentence I) with the same models (that is, for all S-structures QI, 
We, og iff WED). Conclude from this that the Lowenheim-Skolem 
theorem holds for #77. 


2.8 Exercise. Show that the following classes can be axiomatized by an 
L yo7Sentence: 


(a) the class of finitely generated groups; 
(b) the class of structures isomorphic to (Z, <). 


2.9 Exercise. (a) For arbitrary S, show that L%,,., is uncountable. 
(b) Give an uncountable structure 8 (for a suitable countable symbol set S) 
such that there is no countable structure QI satisfying the same L5,,,- 


sentences. 


2.10 Exercise. Using 2.5, show that any two infinite @-structures satisfy 
the same #2 .,-sentences. 


IX. Extensions of First-Order Logic 


§3. The System “g 


The system #Q is obtained from first-order logic by adding the quantifier 
Q, where a formula Qxep says "there are uncountably many x satisfying 9”. 


3.1 Definition of “gy. Compared with the first-order language L*, we add 
the following to constitute the language Lg: 


(a) the symbol Q; 
(b) to the calculus of formulas, the rule: 


If g is an S-formula then so is Qxcp; 


(c) to the definition of the notion of satisfaction, the clause: 
If g is an S-formula and 3 = (YI, £) an S-interpretation then 


a 
3 Oxo iff ‘3 EA|S3 ir of is uncountable. 


L g has more expressive power than 9 , . For example, the class of at most 
countable structures can be axiomatized in &g by the sentence 7Qx x = x. 
For S = {<) let @o be the conjunction of the axioms for orderings and 
(Qx x = x A Vx TQy y < x). Then @p is an L$-sentence characterizing the 
class of uncountable orderings in which every element has at most countably 
many predecessors. These so-called w,-like orderings play an important 
réle in investigations of Ly. 

Note that the sentence @g, or even the sentence Qx x = x, has an un- 
countable, but no at most countable model. Hence the strict analogue of the 
Lowenheim-Skolem theorem does not hold. (However, each satisfiable 
# g-sentence has a model of cardinality <%,, cf. exercise 3.3.) 

One can set up an adequate sequent calculus Sg for #g by adding the 
following rules to the sequent calculus for first-order logic. (After each rule 
an explanatory comment is given, which is also the essence of a correctness 
proof.) 


I Ox@ 


y ° 
lr Oy — 
xX 


if y is not free in g. 


(Renaming of bound variables); 


4 Ox(x = y vx =z) if y and z are distinct from x. 


("Singletons and pair sets are not uncountable"); 


I- Vx(o > W) 
IP Qxcp + Oxy 
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("Sets having uncountable subsets are uncountable"'); 


. 7AQxsye 
Yr Qyixe 
Tl AxOyp 


("If the union of at most countably many sets is uncountable then at least 
one of these sets is uncountable"). 

One can show (cf. [19]}) that the calculus Sg is correct and complete in 
the sense that the equivalence “® — @ iff M |- »” holds for Yg if D is at 
most countable. As for first-order logic we conclude (cf. VI.2): 


3.2 &g-Compactness Theorem. For every countable set D of L}-formulas, D 
is satisjiable if and only if every finite subset of ® is satisfiable. 


The following example shows that the compactness theorem does not 
hold for uncountable sets of formulas. Let S be an uncountable set of con- 
stants and let 


®:= {nc =d\c,deS,c#d} VU {70xx = xt. 


Then every finite subset of ® is satisfiable, but ® itself is not. 


In Chapter VI we saw that the compactness theorem and the Lowenheim— 
Skolem theorem are useful for mathematical applications. None of the 
extensions of ¥, which we have discussed in this section satisfies both 
theorems. The compactness theorem fails for 2 ,,,,,, the Lowenheim-Skolem 
theorem for “gg, and both for ¥;. Does there exist any logical system at all 
which has more expressive power than first-order logic and for which both 
the compactness theorem and the Lowenheim-Skolem theorem hold? We 
shall give a negative answer to this question in Chapter XII. 


3.3 Exercise. Show that every satisfiable 9,-sentence has a model over a 
domain of cardinality at most &, (where &, is the smallest uncountable 
cardinal). (Hint: Use a method similar to that in the proof of 2.5: for formulas 
Qxe, which hold in 8, add &, elements satisfying ¢.) 


34 Exercise. Let #% be obtained from #g by changing the notion of 
satisfaction 3.1(c) as follows: 


3 Ox iff ‘2 E A|S — of is at most countable. 


Show that the compactness theorem does not hold for #9, but that the 
Lowenheim-Skolem theorem does. 


CHAPTER X 
Limitations of the Formal Method 


Only in methodological questions have we thus far referred to the fact that 
applications of sequent rules consist ultimately of mechanical operations on 
symbol strings (cf. VII.1). In the following we also want to make stronger 
use of this formal-syntactic aspect in mathematical considerations. Let us 
give an initial idea, taking as an example the system of axioms ®,, = 
{@o, 1, @2} for group theory. It follows from the completeness theorem 
that for all S,,-sentences ¢, 


O,- go iff Ob 9g. 
Thus @ is a theorem of group theory 
Th,, ae {y € Lee"|®,, E Wt, 


if and only if the sequent @9@,@,@ Is derivable. 

By systematically applying all the sequent rules one can generate all 
possible derivations and thus compile a list of the theorems of Th,,: One 
adds a sentence g € L}* to the list if one arrives at a derivation whose last 
sequent is @9@;@,@. Hence there is a procedure by which one can in a 
"mechanical" way list all theorems of Th,,. It should be plausible that one 
could use a suitably programmed computer to carry out such a procedure. 
Of course, one would have to be able to increase the capacity of the computer 
if necessary since the derivations and the sequents and formulas therein 
can be arbitrarily long. A set such as Th,, which can be listed by means of 
such a procedure is said to be enumerable. 

Of course, the above enumeration procedure yields many trivialities like 
Vx(x = x > x = x). On the other hand, a group theorist is only interested 
in specific group-theoretical statements @ which are relevant for his in- 
vestigations. His aim is to determine for such a @ whether @ € Th,, or not. 
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Usually this is accomplished either by a proof or by a counterexample. 
Unfortunately, an enumeration procedure for Th,, as above is of little help 
in this situation: Given @, one might start the procedure in order to see 
whether @ appears as an output; however, if @ ¢ Th,, the procedure will not 
yield this information since at any step one is left uncertain as to whether 
@ will appear later or will not appear at all. Thus we are led to seek a different 
kind of procedure which can be applied to an arbitrary S,,-sentence @ and 
then stops after finitely many steps, yielding the decision whether ¢ € Th,, 
or not. Put another way, can one program a computer so that whenever it 
is given an S,,-sentence ¢ it "computes" whether ¢ belongs to Th,,? If such 
a procedure exists for a given theory, we call that theory decidable. 

The present chapter is devoted to questions of this kind. First we discuss 
the concepts of enumerability and decidability in more detail, in §1 from a 
naive point of view, and in §2 on the basis of the precise notion of register 
machine. These topics form part of the so-called recursion theory (theory of 
computability). The remaining sections of the chapter then contain applica- 
tions to first-order and second-order logic. 

For further information about recursion theory we refer the reader to 
[6] and [15]. 


1 Decidability and Enumerability 


A. Procedures, Decidability 


It is well known how to decide whether an arbitrary natural number n is 
prime: If n = 0 or n = 1, n is not prime. If n => 2, one tests the numbers 
2,3,...,” — 1 to see whether they divide n. If none of these numbers divides 
n then 7 is prime; otherwise it is not. 

This procedure operates with strings of symbols. For example, in the 
case of decimal representation of natural numbers it operates with strings 
over the alphabet {0,..., 9}. Our description has not specified it in complete 
detail—for instance, we have not described how division is to be carried 
out— but it should be clear that it is possible to fill these gaps in order to 
ensure that all steps are completely determined. In view of its purpose we 
call the procedure a decision procedure for the set of primes. 

Other procedures which are well known include those for 


(a) multiplying two natural numbers, 
(b) computing the square root of a natural number, 
(c) listing the primes in increasing order. 


Common to all of these procedures is the fact that they proceed step by 
step, they operate on symbol strings of a well-defined sort, and they can be 
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carried out by a suitably programmed computer. A procedure can operate 
On one or more inputs (as in (a) or (b)) or it can be started without any 
particular input (as in (c)). It can stop after finitely many steps and yield an 
output (as in (a) for any input and in (b) for inputs which are squares), or it 
can run without ever stopping, possibly giving an output from time to 
time (as in (c)). 

Procedures in our sense (effective procedures, processes, algorithms) 
operate with concrete objects such as symbol strings. Sometimes mathe- 
maticians use these notions in a wider sense, speaking, for instance, of the 
Gram-Schmidt orthogonalization process even when referring to abstract 
vector spaces. 

Concerning the following definition and the subsequent discussion the 
reader should bear in mind that the notion of procedure has so far been 
introduced just in an intuitive way and by means of examples. 


1.1 Definition. Let A be an alphabet, W a set of words over A,1.e., W < A*, 
and ‘88 a procedure. 


(a) ‘B is a decision procedure for W if, for every input ¢ € A*, $$ eventually 
stops, having previously given exactly one output 7 € A*, where 


n=U), iffew, 
n#O, iffew. 
(b) W is decidable if there is a decision procedure for W. 


Thus when a decision procedure for W is applied to an arbitrary word ¢ 
over A, it yields an answer to the question “¢ € W?" in finitely many steps. 
The answer is "yes" if the output is the empty word; it is "no" if the output 
is a nonempty word. 

To formulate the above decision procedure for the set of primes according 
to definition 1.1 we set A :={0,...,9) and W -=the set of primes, and we 
agree that the empty word shall be the output for primes and, say, 1 the 
output for nonprimes. 

Further examples of decidable sets are the set of terms and the set of 
formulas for a concretely given symbol set. In the case of S, (cf. 11.2) for 
instance, terms and formulas are strings over the alphabet 


Aig t= os tive eg Ns Vira: = Ih OSs 


We sketch a decision procedure for the terms. 

Let ¢ € A* be given. First determine the length /(¢). If (f) = 0,¢ is not a 
term. If (€) = 1, ¢ is a term if and only if ¢ is a variable or a constant. If 
©) > 1, € is not a term unless it begins with a function symbol. If ¢ begins 
with a function symbol, say € = f?<', then check whether there is a de- 
composition ¢° = €9¢,¢,, where the ¢; are terms. ¢ is a term if and only if 
such a decomposition exists. To check whether each €; is a term, use the 
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same procedure as for ¢. (Clearly, in this way an answer will be obtained 
after finitely many steps.) 

If one analyzes the procedure or tries to program it for a computer, a 
difficulty arises: programs (or descriptions of procedures) are finite and 
therefore can only refer to finitely many symbols in A,, whereas A, con- 
tains, among other things, the infinite list of symbols v9,v;,03,... . Therefore 
we introduce the new finite alphabet 


Ag h00) 105259 0.1, ke Ved, =,),GR fic} 


and then represent the symbols in A, in terms of the symbols of Ag in the 
natural way. For example, we represent v7, by vZ1, c,, by cll, Rig by R318 
and the S,,-formula Jv3(Riv3 v c,, = fdv,) by Ww3(R11v3 v cll = f10vl). 
With this in mind we only consider finite alphabets in the sequel. 


1.2 Exercise. Let A be an alphabet, and let W, W' be decidable subsets of 
A*. Show that W u W', Wm Wand A* — W are also decidable. 


1.3 Exercise. Describe decision procedures for the following subsets 
of Ag: 


(a) the set of strings xg over A, such that x € free(), 
(b) the set of S,-sentences. 


B. Enumerability 


We consider a computing machine which operates as follows: it successively 
generates the numbers Q, 1, 2, ..., tests in each case whether n is a prime, 
and yields n as output if the answer is positive. The machine runs without 
ever stopping, and it generates a list of all primes, 1-e., a list in which every 
prime eventually appears. 

Sets, such as the primes, which can be listed by means of a procedure are 
said to be enumerable: 


1.4 Definition. Let A be an alphabet, W < A* and SB a procedure. 


(a) 88 is an enumeration procedure for W if ‘8, once having been started, 
eventually yields as outputs exactly the words in W (in some order, 
possibly with repetitions). 

(b) W is enumerable if there is an enumeration procedure for W. 


We give some further examples for enumerable sets. 


1.5 Proposition. F A is a (finite) alphabet, then A* is enumerable. 
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PRooF. Suppose A = {a,, ...,a,). We first define the lexicographic order on 
A* (with respect to the indexing a,, ...,a,). In this ordering ¢ precedes ¢’ 
if either 

Kg) < KC’) 


Kc) =e’) and “¢ precedes ¢" in a dictionary ”, 


that is. there are a;,a;¢ A, with i <j, such that for suitable ¢, 4, 7’ € A*, 

¢ = Cain and ¢ = Cajn’. 

For example, if A = {a,b,c,...,x,y, Zz}, then "papa" comes before 
“papi”, but after ''zoo”. In general the ordering begins as follows: 


LN. dy, 22250, Ap Ap, 4944, 2.2, Ag Aq, A105 2. .5 A, A_, ApApag,.... 


It is easy to set up a procedure which lists the elements of A* in lexicographic 
order. 


1.6 Proposition. {g € Lg”| — @} is enumerable. 


PROOF. By the completeness theorem we have to describe a procedure which 
lists the S,,-sentences @ with t- g. We use the same idea as in the procedure 
for listing Th,, at the beginning of this chapter: We systematically generate 
all possible derivations for the symbol set S,. If the last sequent in such a 
derivation consists of a single sentence @, we include @ in the list. Note that 
the derivations can be generated as follows: Forn = 1,2,3,...oneconstructs 
the first n terms and formulas in the lexicographical ordering, and one 
forms the finitely many derivations of length <n which use only these 
formulas and terms, and which consist of sequents containing at most n 
members. LJ 


C. The Relationship Between Enumerability and 
Decidability 


We have just seen that the set of "logically true" sentences can be listed by 
means of an enumeration procedure. Is it possible to go farther than this 
and decide whether an arbitrary given sentence is "logically true"? The 
enumeration procedure given above does not help to solve this problem. 
For example, if we want to test a sentence @p for validity we might start the 
enumeration procedure in 1.6 and wait to see whether @, appears; we obtain 
a positive decision as soon aS @p is added to the list. But as long as @o has 
not appeared we cannot say anything about @p» since we do not know 
whether @, will never appear (because it is not valid) or whether it will 
appear at a later time. In fact we shall show (cf. 4.1) that the set of valid 
S,,-sentences is not decidable. 
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On the other hand, if a set is decidable we can conclude that it is 
enumerable: 


1.7 Theorem. Every decidable set is enumerable. 


PROOF. Suppose W < A* is decidable and ‘$ is a decision procedure for 
W. To list W, generate the strings of A* in lexicographic order, use ‘B to 
check for each string € whether it belongs to W or not, and, if the answer is 
positive, add ¢ to the list. LI 


As an extension of 1.7 we have: 


1.8 Theorem. A subset W of A* is decidable if and only if Wand the comple- 
ment A* — W are enumerable. 


PROOF. Suppose W is decidable. Then A* — W is also decidable (one can 
use a decision procedure for W, merely interchanging the outputs "yes" 
and "'no"'). Thus by 1.7, W and A* — W are enumerable. Conversely, 
suppose W and A* — W are enumerable by means of procedures ‘$ and 
$8’. We combine $ and SB’ into a decision procedure for W which operates 
as follows: Given ¢, 3 and 9%’ run simultaneously until ¢ is yielded by either 
$8 or $B’. This will eventually be the case since every symbol string in A* is 
either in W or in A* — W.If € is listed by 8, the output is "yes" (€ € W), if 
it is listed by ‘B’, the output is "no" (¢¢ W). LO 


1.9 Exercise. Suppose U ¢ A* is decidable and W < U. Show that if W 
and U — W are enumerable, then W is decidable. 


Our definitions of decidability and enumerability were given with respect 
to a fixed alphabet. However, this reference is not essential: 


1.10 Exercise. Let A, and A, be alphabets such that Ay © A,, and suppose 
W c Af. Show that W is decidable (enumerable) with respect to A, if and 
only if it is decidable (enumerable) with respect to A,. 


D. Computable Functions 


Let A and B be alphabets. A procedure which for each input from A* yields a 
word in B* determines a function from A* to B*. A function whose values 
can be computed in this way by a procedure is said to be computable. An 
example of a computable function is the length function /, which assigns to 
every ¢ € A* the length of ¢ (in decimal notation as a word over the alphabet 
{0, ..., 9}). 

Whereas our discussion of effective procedures deals mainly with the 
notions of enumerability and decidability, many presentations of recursion 
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theory start with computability of functions as the key concept. Both 
approaches are equivalent in the sense that the above notions are definable 
from each other. The following exercise shows that the notion of computable 
function can be reduced to both the notion of enumerability and the notion 
of decidability. 


1.11 Exercise. Let A, B be alphabets, # ¢ A u B, and f: A* > B*. Show 
that the following are equivalent: 


(i) f is computable. 
(ii) {w # £(w)|we A*} is enumerable. 
(iii) {w # f(w)|w © A*} is decidable. 


{w # {(w)|we A*} can be considered as the graph of f, and hence the 
equivalences in 1.11 can be formulated as follows. A function is computable 
iff its graph is enumerable (decidable). Note that by 1.8 the notion of decid- 
ability can be reduced to that of enumerability. 


§2. Register Machines 


In the foregoing discussion we have used an intuitive notion of procedure 
which we illustrated by use of examples. The conception we have thus 
acquired is perhaps sufficient for recognizing in a given case whether a 
proposed procedure can be accepted as such. But in general, our informal 
concept does not enable us to prove that a particular set is not decidable. 
Namely, in this case one must show that every possible procedure is not a 
decision procedure for the set in question. But such a proof is usually not 
possible without a precise notion of procedure. 

We now introduce such a precise concept, starting from the idea that a 
procedure should be programmable on a computer. For this purpose we set 
up a programming language and define procedures in the formal sense to be 
exactly those procedures which can be programmed in this language. 

For the following discussion we fix an alphabet 


A= Lips sxe Ay) 


The programs are executed by computers with a memory consisting of 
units Ro,...,R,, called registers. (In the literature such machines are 
frequently called register machines.) At each stage in a computation every 
register contains a word from A*. We assume that we have machines with 
arbitrarily many registers at our disposal, and that the individual registers 
can store words of arbitrary length. This idealization agrees with our objec- 
tive of encompassing all procedures which can be carried out in principle by 
a computer, 1.e., disregarding problems of capacity. 
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A program (over A = {a,, ...,a,)) consists of instructions, where each 
instruction begins with a natural number L, its label. Only instructions of the 
form (1) through (5) below are permitted. (The instructions describe very 
elementary operations on registers; for instance, they will add or delete a 
symbol in a register, or will test whether a register contains the empty word 
or not. Together with the instructions we give their precise meaning in 
parentheses.) 


for L, i, j¢ N with j < r (Add-instruction: "Add the letter a; to the word 
in register R,"'); 


(2) L LET R; =R,; — 4, 
for L, i, j3¢ N with j < r (Subtract-instruction: "If the word in register 
R; ends with the letter a;, delete this a,;; otherwise leave the word un- 
changed”); 


(3) LIF R,;=UJ THEN L’' ELSE Lg OR...OR L, 
for L, i, L', Lg,..., L,¢ N (Jump-instruction: "If register R; contains 
the empty word go to instruction labelled L'; if the word in register R; 


ends with a, (resp. a;,...,a,) go to instruction labelled Ly» (resp. 
Ly,...,L,)"); 


(4) L PRINT 
for Le N (Print-instruction: "Print as output the word stored in 
register Ry”); 


(5) L HALT 
for L € N (Halt-instruction: "Halt"). 


2.1 Definition. A register program (or simply program) is a finite sequence 
a,, ..., a, Of instructions of the form (1) through (5) with the following 
properties: 


(i) a; has label iGi = 0,...,k). 
(ii) In an instruction of the form (3) the labels L', Lo,..., L, are <k. 
(ii) Only a, is a halt-instruction. 


Each program P gives rise to a procedure: Imagine we have a computer 
which contains all registers occurring in P and which has been programmed 
with P. At the beginning of a computation all registers with the possible 
exception of Ry are empty, 1.e., they contain the empty word, whereas Ro 
contains a possible input. The computation proceeds stepwise, each step 
corresponding to the execution of one instruction of the program. Beginning 
with the first instruction one proceeds line by line through the program, 
jumping only as required by a jump-instruction. Whenever a print-instruc- 
tion is encountered, the respective content of Rp is given as an output 
("printed out"). The machine stops when the halt-instruction is reached. 
Some examples of programs follow. 


152 X. Limitations of the Formal Method 


2.2 Example. Let A = {|}. We interpret the strings UJ, |, ||,... as the natural 
numbers Q, 1, 2,.... The following program P, decides whether an input in 
the register Rg is an even number or not: Pg successively deletes strokes from 
the string n given as an input in Rg until the empty string is obtained. It 
ascertains whether n is even or odd and prints out LJ or | accordingly and 
then stops. 


0 IF Rj =O THEN 6 ELSE 1 
1 LET R,=R,-| 

2 IF R, = THEN 5 ELSE 3 
3 LETR,=R,-| 

4 IF R,=( THEN 6 ELSE 1 
5 LET R,=Ry +1 

6 PRINT 

7 HALT. 


We say that a program P is started with a word ¢e€ A%*, if P begins the 
computation with € in Ry and L] in the remaining registers. If P, started with 
¢, eventually reaches the halt-instruction, we write 


P:¢ > halt; 
otherwise we write 
Pit = oo: 
For ¢,n e€ A*, 
P:{—>%n 


means that P started with ¢ eventually stops, having—in the course of the 
computation— given exactly one output, namely y. In the above example, 


Po:n—->U, if nis even, 


Po:n—|,  ifnisodd. 


2.3 Example. Let A = {a,, ...,a,}. For the program P: 


0 PRINT 

1 LET Ryo =R,+ta, 

2 IF R,»=(C THEN 0 ELSE 0 OR...OR 0 
3 HALT 


we have P: ¢ > oo for all ¢. If P is started with a word ¢, P prints out succes- 
sively the words ¢, Cag, €€g do, ... 
Line 2 of P has the form 


L IF Ryo = LJ THEN L' ELSE L' OR...OR L. 


In every case such an instruction results in a jump to line L’. For the sake of 
simplicity we shall in the sequel abbreviate it by 


L GOTO L’. 
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24 Example. We present a program P for the alphabet A = {a,, a,), such 
that P: ¢ > ¢¢ for ¢ € A*. (Given ¢ in Rg, instructions 0-8 serve to build up 
¢ in reverse order in the registers R, and R2, after which €¢ is built up in Ro, 
taking the first copy from R, (instructions 9-15) and the second copy from 


R, (instructions 16-22).) 
IF R, = LJ THEN 9 ELSE 1 OR 5 


bho = SS SS 
NS SRRAUABRODHKOUOMIDUNAWNHH SOS 


SB 


As 


LET Ry = Ro — ao 
LET R, =R, + ag 
LET R,=R,; #a, 
GOTO 0 

LET Ry =Ro - a, 
LET R,=R, ta, 
LET R, = R, + a, 
GOTO 0 

IF R, = 0 THEN 
LET R, =R; ~ a 
LET Ry = Ry # a 
GOTO 9 

LET R, =R, -4; 
LET Ro — Ro + a, 
GOTO 9 

IF R, =O THEN 
LET R, = R, - a, 
LET R, =Ro ta, 
GOTO 16 


_LET R,=R,-a, 


LET R, =Ro ta, 
GOTO 16 
HALT. 


P:¢ — halt, 
P:6> %, 


16 ELSE 10 OR 13 


23 ELSE 17 OR 20 


an exercise the reader should write a program P over the alphabet 
A = {a,, a,, a,) which accomplishes the following: 


By analogy with the naive definitions in §1, we can introduce the exact 
notions of register-decidability and register-enumerability. 


2.5 Definition. Suppose W < A*. 


(a) Aprogram P decides W if for all ¢ € A*, 
P:¢>U, if 


P:€>ynwithn 4 UO, 


if € = dyaay, 


if € # ap doa). 


if C ¢ W. 
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(b) W is said to be register-decidable (abbreviated: R-decidable) if there is a 
program which decides W. 


Example 2.2 shows that the set of even natural numbers is R-decidable. 


2.6 Definition. Let W < A*. 


(a) A program P enumerates W if P, started with LJ, prints out exactly the 
words in W (in any order, possibly with repetitions). 

(b) W is said to be register-enumerable (abbreviated: R-enumerable), if there 
is a program which enumerates W. 


If P enumerates an infinite set, then P:q—-o. By 2.33, W= 
(QU, a,, @9a,...} is R-enumerable. The program 0 HALT enumerates the 
empty set, as does the program 


0 LETR,=R, ta 
1 GOTO 0 
2 HALT. 


For the sake of completeness we add the definition of register-computable 
functions. 


2.7 Definition. Let A and B be alphabets and F: A* > B*. 
(a) A program Pover A vw Bcomputes F if for all ¢ € A: 


P:0—> F(6). 


(b) F is said to be register-computable (abbreviated: R-computable) if there 
is a program over A VU B which computes F. 


In this terminology, program P of 2.4 computes the function 
F: {a,, @,}* > {a,, a,}* with F(Q) = Cf. Definitions 2.5 through 2.7 can 
easily be extended to n-ary relations and functions. For example, in order 
to use a program to compute a binary function, one enters the two arguments 
in the first two registers. 

Since any program describes a procedure it is clear that every R-decidable 
set is decidable, every R-enumerable set is enumerable, and every R-com- 
putable function is computable, Does the converse also hold? In other words, 
can every procedure in the intuitive sense be simulated by means of a 
program? A mathematical treatment of this problem is not possible because 
the concept of procedure is an intuitive one without an exact definition. 
Nevertheless, in spite of the simple form of the instructions allowed in 
register programs, it is widely accepted today that all procedures can indeed 
be simulated by register programs, and consequently that the intuitive 
concepts of decidability and enumerability coincide with their mathe- 
matically precise R-analogues. This view was first expressed by A. Church 
in 1936 (referring to a different but equivalent precise notion of decidability 


and eannmerahilitus Thareafare tho elaim that avariw nenradiien nan ha 
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simulated by a program and hence that the concepts of enumerability and 
decidability coincide with their precise counterparts is called Church’s Thesis. 
We mention two arguments which support this thesis. 


ARGUMENT |: Experience. Hitherto it has always been possible to simulate 
any given procedure by a register program. In particular, programs in 
programming languages such as ALGOL or FORTRAN can be rewritten 
as register programs. 


ARGUMENT 2. Since 1930 numerous mathematical concepts have been pro- 
posed as precise counterparts to the notion of procedure. Although developed 
from different starting points, all of these definitions have turned out to be 
equivalent. In the literature R-decidable sets and R-computable functions 
are frequently called recursive, and R-enumerable sets are called recursively 
enumerable. 


Proofs of R-enumerability or R-decidability often require a considerable 
amount of programming work. To avoid getting lost in details, rather than 
actually writing down register programs, we shall usually content ourselves 
with describing procedures intuitively. The following example should help 
to illustrate this. 


2.8 Example. The set of valid S,,-sentences is R-enumerable. 
As proof we accept the procedure described in 1.6. C] 


In the following exercises the critical reader is invited to practice writing 
programs for given procedures. The more trusting reader may instead draw 
upon the experience of others and rely on Church's thesis. 


2.9 Exercise. Suppose W, W' c A*.Show that if W and W' are R-decidable, 
then so are A*¥ — WLWoa W',and Wu W'.. 

2.10 Exercise. Suppose W < A*. Show: 

(a) A* is R-enumerable. 

(b) W is R-decidable if and only if W and A* — W are R-enumerable. 
2.11 Exercise. Suppose W ec A*. Show that (a) and (b) are equivalent. 


(a) W is R-enumerable. 
(b) There is a program P such that 


P:¢>-0, iff€e€W. 
P:€>+ x, iff ¢ W. 


2.12 Exercise. Show thataset W < A* is R-decidable iff W is R-enumerable 
in lexicographical order. 
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§3. The Halting Problem for Register Machines 


Again we fix an alphabet A = {a,, ..., a,). Our aim is to present a subset of 
A* which is not R-decidable. The set will consist of register programs (over 
A) which are suitably coded as words over A. 

For this purpose we associate with every program P (over A) a word 
fp € A*. First we extend A to an alphabet 5 


(4) B=AV{A,B,C,...,X, YZ} {0,1,...,8,9} vu {=, +, -, 0, |} 


and we order B* lexicographically according to the order of letters given in 
(+). We represent a program P as a word over B, e.g., the program 


O LET R,=R, -—4 
1 PRINT 
2 HALT 


is represented by the word 


OLETRI=RI—ag|1PRINT|2HALT. 


If this word is the nth word in the lexicographic ordering on B*, let 


Cpi= dg... ay. 
Nance annem 
n-times 
Set IT := {€p|P is a program over A). 

The transition from P to &> (1.e., the numbering of programs over A with 
words in {a)}*) is an example of a Gédei numbering (Gédel was the first to 
apply this method); and &p is called the Gédel number of P. 

Clearly, for each P we can effectively determine the corresponding 
Cp € A*; conversely, given € € A*, we can decide whether it belongs to [ 
or not, and if it does we can effectively determine the program P such that 
€p = ¢. The corresponding procedures can be programmed for register 
machines (cf. the discussion at the end of §2). In particular we have 


3.1 Lemma. IT is R-decidable. LI 
The following theorem presents first examples of R-undecidable sets. 


3.2 Theorem (Undecidability of the Halting Problem). (a) The set 
I;,,,, = {¢p|P isa program over A and P: Ep > halt) 


is not R-decidable. 
(b) The set 


Mian = {€p|P is a program over A and P: LJ - halt) 
is not R-decidable. 
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Part (b) says that there is no register program which decides the set 
{an Hence by Church's Thesis there is no procedure whatsoever which 
decides T,,,,,- From this we obtain the following formulation of 3.2(b): 


There is no procedure which decides for an arbitrarily given 
program P whether P: q — halt. 


For, if such a procedure ‘$ did exist, one could use it to decide [,,,,, as follows. 
First, for a given €, check whether ¢ € IT (cf. 3.1). If ¢¢ TI then ¢ € Myan- If 
¢ € II, construct the program P for which ¢p = ¢ and then apply B to P. 


PROOF OF 3.2. (a) For a contradiction, suppose that there is a program Po 
deciding IT},41- Then for all P: 


(1) Po: op > U, if P: €p > halt, 
Po: gp >for somen 4 UO, if Piopo a. 
From this we easily obtain a program P, (see below) such that 


Q) Py: bp x, if P: Ep > halt, 
P,:ép— halt, ifP:¢poa@. 


Then the following holds for all programs P: 

(3) Pi:ép> co iff P: Ep — halt. 
In particular, if we set P = P,, we have 

(4) Pi: &p, > & iff Py: Cp, = halt, 


a contradiction. 

To complete the proof of 3.2(a) we show how to construct P,; from Po: 
Wechange P, in sucha way that if Py prints theempty word, the new program 
P, will not reach the halt instruction. This is achieved by replacing the last 
instruction k HALT in Py by 


k IF Ro = UO THEN k ELSE kK +1 OR...OR k +1 
k+1 HALT 


and all instructions of the form L PRINT by L GOTO k. 
(b) To each program P we assign in an effective way a program P* such 
that 


(*) P: €p > halt iff P*: q — halt, 
* 
that iS, Ep € Tate iff Sp. € That: 


Using (*) we can prove (b) indirectly as follows: 


Suppose that I], is R-decidable, say by means of the program Py. Then in 
contradiction to (a), we obtain the following decision procedure for Ika: 
For an arbitrarily given € € A* first check whether ¢ € I (cf. 3.1). If ¢€T1 
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then ¢ ¢ 1,4. If ¢ ¢ H take the program P with Godel number 6, (i.e., with 
€p = Cand construct P*. Using Py, decide whether ¢p+ € T,,,,,- On account 
of (*) one thus obtains an answer to the question whether ép € Mhay, 1¢., 
whether € € [Thay,. 


It remains to define a program P* satisfying (*). If 


Cp = a, m8 a, 
ee ae 
n-times 


let P~ be the program which begins with the lines 


0 LETR,=Rota, 


n~l LET Rg =Ro ta, 


followed by the lines of P with all labels increased by n. When P* is started 
with LJ as input, it first builds up the word €pin Rp, and then operates in the 
saine way as the program P applied to €p. Hence (*) holds. LJ 


The reader should note that the only properties of the map Pt> &p used 
in the proof were its injectivity and properties of effectiveness as mentioned 
before 3.1. Therefore the undecidability of the halting problem does not 
depend on our particular choice of Godel numbering. 

Of course, for particular programs P it may be easy to determine whether 
P: () = halt or not. But theorem 3.2 tells us that there cannot exist a pro- 
cedure which decides this question "uniformly" for each P. (Strictly speaking, 
3.2 only refers to procedures which can be simulated by register programs. 
However, we obtain our preceding formulation if we accept Church's 
thesis. Henceforth we shall tacitly do this in explanatory remarks.) 

The following lemma together with 3.2 shows that IT.) is an example of 
an enumerable set which is not decidable. 


3.3, Lemma. [1,,,), is R-enumerable. 


PROOF. We sketch an enumeration procedure: for n = 1, 2, 3,... generate 
the finitely many programs whose Gédel numbers are of length In. Start 
each such program with LJ as input, and let each one perform n steps of its 
computation. To compile the desired list, note each program which stops. 


Applying 1.8, we obtain 
3.4 Corollary. A* — Wy, i not R-enumerable. C 


The proof of 3.2(a) is based on a so-called "diagonal argument". The 
following exercise contains an abstract version of this method of proof. 
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3.5 Exercise. (a) Suppose that M is a nonempty set and R <M x M. For 
a €M let 


M, = {be M|Rab}. 
Show that the set 
D = {b€ M {not Rbb) 


is different from each M,. 


(b) Let M = A* for some given alphabet A = {a,, ...,a,), and define 
RaoM x Mby 


Réy_ iff ¢ is the Godel number of a program P enumerating 
a set in which y occurs. 


Show that 
D = {n|not Ryn} 


is not enumerable. Thus the set of programs which do not print their own 
Gédel number is not enumerable. 
(c) Let M be as in (b) and define R © M x M by 


Réy iff € is not the Gédel number of a program which 
eventually stops when started with 7. 


Show that all R-decidable sets Cc A*) occur among the sets M; and that 
D= TTpait- 


§4. The Undecidability of First-Order Logic 


The set of valid first-order S,-sentences is enumerable (cf. 1.6). On the 
other hand we have: 


4.1 Theorem (Undecidability of First-Order Logic). The set {g € L3=|= o} 
d valid S,-sentences is not R-decidable. 


Thus there is no procedure which decides, for an arbitrary S,,-sentence, 
whether it is valid or not. 


ProoF. We adopt the notation of §3 with A = {|}. Again we identify words 
over A with natural numbers. By 3.2 we know that the set 


IInan = {Zp|P isa program over A and P: LJ > halt} 


is not R-decidable. We shall assign to every program P, in an effective way, 
an S,,-sentence @p such that 


(*) E@p iff P: Ul +halt. 
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Then we are done: If the set {p € L{”|E@} were decidable, we would 
have the following decision procedure for [1,,;, (a contradiction): Given 
¢ € A*, first check whether ¢ is of the form ép. If so, take P, construct @p 
and decide whether @p is valid. By (*) we obtain an answer to the question 
whether P: L] = halt, i.e., whether Ep € Mnyan- 

The following considerations are preparatory to the definition of the 
sentences @p. 

Let P bea program with instructions &,...,4,. Denote by n the smallest 
number such that the registers occurring in P are among Rg,..., R,. An 
(n + 2)-tuple (L, mg,...,m,) of natural numbers with L Ik is called a 
conjiguration of P. We say that (L,mp,...,m,) is the conjiguration d P 
after s steps if P, started with LJ, runs for at least s steps, and if after s steps 
instruction L is to be executed next, while the numbers my,...,m, are in 
Ro,.-., R,, respectively. In particular, (0,0,...,0) is the configuration of 
P after O steps (the "initial configuration ”). Since only «, is a halt-instruction 
we have 


(1) P:LJ— halt iff for suitable s,mo,..., m, 
(k, m9,..-,m,) 1s the configuration of P after s steps. 


If P: LJ — halt, we let sp be the number of steps carried out by P until it 
arrives at the halt-instruction. Finally we choose symbols R ((n + 3)-ary), 
< (binary), f (unary), and c from S,, (e.g., R3*?, Ro, fg, and co), and set 
S:={R, <, f,c). 

With the program P we associate an S-structure %p within which we shall 
describe how P operates. We distinguish two cases: 


Case 1, P: L1> x. We set Ap:=N and interpret < by the usual 
ordering on N, c by 0,f by the successor function, and R by the relation 
{(s, L. mo,..., m) |(L, Mo,..., m,) is the configuration of P after s steps), 
respectively. 


Case 2. P: 1] > halt. We set e :=max{k, sp} and Ap:= {0,...,e}, and 
interpret < by the usual ordering on Apand c by 0, respectively; furthermore 
we define f4? by f4°(m) = m + 1 for m < e andf **(e) = e, and set R*?:= 
{(s, L, Mo,...,7,)|C(L, Mo,-.-, m,) is the configuration of P after 5 steps). 
(Note that RP? is indeed a relation on A, since at each step P increases the 
contents of each register by at most 1, and hence we have mop,...,m, < 
Sp I e for all (s. L. my....,m,)€ R*?.) 

Now we provide an S-sentence {Wp which, in a suitable way, describes the 
operations of Pon L]. We abbreviate c,fc, ffc,... by 0, 1, 2,..., respectively. 
In reading Wp the reader should convince himself that the following holds: 


(2) (a) Up = Wp. 
(b) If Wis an S-structure with W = Wp and (L, mo,...,m,) is the con- 
figuration of P after s steps, then the elements 04, 14,..., §4 are 
pairwise distinct and We RsLmy...M,. 
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We set 
Wp = Wo N RO...0a Wa. A ee 2. Wie sas 


where Wo, Wa,..., Wax-1 Will now be defined. Wo says that < is an ordering 
whose first element is c, that x < fx holds for every x, and that fx is the 


immediate successor of x in case x is not the last element. 
Wo :-=—" < is an ordering" a Vx(e<xvcz=x) 
A Vx(x < fx v x = fx) Aa Vx(dyx < yo(x < fx 
AVW2ax<z3o(fx<zv fx = z)))). 


For & = &,..., %— 4, the sentence W, describes the operation corresponding 
to instruction «. It is defined as follows. 


Fora=L LET R;=R,+|: 
Wy = VX Vy9... Wy RX Lyo 2. Vy (x < fx 
A REx L + ly... Vina fWiVji41-- + Vad 
For « = L LET R,; =R, - | 
Wy i= Vx V9... Wy (Rx LYV9... y, @ (x < fix 


A((y; = 0 4 RfxL F yp... ¥,) v (Ty; = 9 
A du(fu = y; A RAXL F Lyo... Yi 141 -+- Yn). 


For x = L IF R; = LJ THEN LW’ ELSE Ly: 

Wy t= VX V9... Vy (Rx Lyo... y, 2 (x< fx 

A((y, = 04 RfXL’yy... y) v (Ty, = 0 4 REXLo Yo .-+ Y,)))). 

For « = L PRINT: 

Wa = VX Vy. Vy (RX Lyy ..- Vn OX < fx A RExL + Tyo... y,)). 

Now we set 
(3) Op:= Wp > Ax Jyg... Jy, Rxkyo... y,. 
Then @p is an S-sentence which satisfies (*), 1.e., 

E@p iff P: LI halt. 


Indeed, suppose first that @p is valid. Then in particular Wp E @p. Since 
Wp = We (cf. (2)(a)), we have Wp = Ax dy... dy, Rxkyo... yy (cf. (3)), ie, 
for suitable s, m9,..., m,, (k, Mg,---, m,) is the configuration of P after s 
steps. Now, (1) yields P: LJ — halt. 

Conversely, if P: L] + halt, then for suitable s, mg,...,m,, the tuple 
(k,mog,---,™,) 18 the configuration of P after s steps. Hence @p is valid, 
because if 2 is an S-structure such that Wf wp, then Wt Rskiig ... m, by 
(2)(b) and hence WE @p. 
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The undecidability of first-order logic was first proved by A. Church [5} 
in 1936. In traditional logic (Llull, Leibniz) the problem of finding a decision 
procedure for "logically true propositions" had already been considered. 
4.1 shows that such a search was bound to fail. 


4.2 Exercise. Prove (2)(b) by induction over s. 


4.3 Exercise. Show that the set of satisfiable S,,-sentences is not R- 
enumerable. 


§5. Trahtenbrot's Theorem and the 
Incompleteness of Second-Order Logic 


The object of this section is to prove that the set of valid second-order 
S,,-sentences is not enumerable, and to briefly discuss the methodological 
consequences. A useful tool in this context will be Trahtenbrot's theorem, 
which says that the set of sentences valid in all finite structures is not 
enumerable. 


5.1 Definition. (a) An S-sentence @ is said to be fin-satisfiable if there is a 
finite S-structure which satisfies @. 
(b) An S-sentence @ is said to be fin-valid if every finite S-structure satisfies . 


To be specific we consider the symbol set $,, and put 
®,, :-= {g € L§”|¢ fin-satisfiable), 
®,, = {@ € L3|@ fin-valid). 


As an example, we note that over a finite domain any injective function is 
also surjective; therefore the sentence gy :=QxV}(fx = fy—>-x=y)> 
Qxdy x =fy is fin-valid; however, ~ is not valid. The sentence ¢@ is 
satisfiable but not fin-satisfiable. 


5.2 Lemma. ®,, is R-enumerable. 


PROOF. First we describe a procedure which decides, for every S,,-sentence 
~ and every n > 1, whether or not @ is satisfiable over a domain with n 
elements. Suppose ¢ and n are given. Since for every structure with n elements 
there is an isomorphic structure with domain {I, ..., ”}, we only need to 
check (by the isomorphism lemma) whether ¢ is satisfiable over {1,..., n}. 
Let S be the (finite!) set of symbols occurring in g and YWy,..., W, be the 
finitely many S-structures with domain {1,..., n} (cf. H.1.5). We can describe 
the YI, explicitly by means of finite tables for the relations, functions and 
constants. @ is satisfiable over {1,...,n} if and only if UW; @ for some 
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i < k. Thus we only need to test whether YW; = @ fori = 0,..., k. These tests 
can be reduced to questions which can be answered from the respective 
tables as follows: Given i < k, if @ = Ty then the problem “YW; = gy?” can 
be reduced to the question of whether W,=w. If @ = (W v 4) then similarly 
the problem can be reduced to the questions of whether %; = Y and whether 
Wt xy. If go = 3xwW we reduce to the questions “YW; Wf1]?”,..., “Up= 
Ww[n]?”. Continuing in this way we eventually arrive at questions of the 
form “Wt W[no,...5%m,-1]?” for atomic formulas Wvo,-..,Um—1) and 
No>--++5m—1 I n. Clearly these can be answered effectively by inspecting the 
tables for YW;. 

Now ®,, can be enumerated as follows: For m= 1, 2,... generate the 
(finitely many) words over A, which are S,,-sentences and are of length <m, 
and use the procedure just described to decide, for n = l,...,m, whether 
they are satisfiable over a domain with n elements. List the sentences for 
which this is the case. O 


5.3 Theorem. ®,, is not R-decidable. 


Proor. For a program P over A =|l|,_ let Wp and wp be defined as in the 
proof of 4.1. We show 


(*) P:O halt iff ppe®,,. 


This proves the theorem; for otherwise, using (*), one could obtain from a 
decision procedure for ®,, a procedure to decide whether P: LJ = halt 
(cf. the corresponding argument in the proof of 4.1). 

Proof d (x). If P: LJ > halt, then 1p is finite and is a model of Wp. Hence 
wp € D,,. Conversely, if P: L) > 2, then by (2)(b) in the proof of 4.1, the 
elements 0”, 1"... are pairwise distinct in every model Y of wp. Thus every 
model of wp is infinite, and hence Wp ¢ ®,,. C 


From 5.2 and 5.3 we now obtain 


5.4 Trahtenbrot's Theorem. The set ®,, ¢ 5S,,-sentences valid in all finite 
structures is not R-enumerable. 


PROOF. Clearly, 
(*) peLs- —®, iff 19 E®,, 


holds for g € L3. For a contradiction assume that ®,, is enumerable. Then, 
using (*), one can also enumerate L3* — ®,,: one simply starts an enumera- 
tion procedure for ®,,, and whenever it lists a sentence “gq, one writes down 
~. This would lead to a decision procedure for ®,, (in contradiction to 5.3) 
as follows: For a string € over A,, decide first whether ¢ is an S,-sentence. 
If so, start enumeration procedures for ®,, (cf. 5.2) and for L3” — ®,,, and 
let both procedures continue until one of them yields ¢ as output. Thus one 
obtains a decision whether ¢ € ®,,. O 
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5.5 Theorem (Incompleteness of Second-Order Logic). The set of valid 
second-order $,,-sentences is not R-enumerable. 


PRooF. Let @;;, be a second-order S$,,-sentence with the property that for 
all 24, 


We go, iff Wis finite 
(cf. [X.1). Then for all first-order S,,-sentences g, 


(*) pe®, iff H Prin > &. 


Now, if the set of valid second-order S,,-sentences is R-enumerable, then 
one can start an enumeration procedure for this set, and each time it yields 
a sentence of the form @,;,, —> @, where y € L3”, one adds ¢ to the list. By 
(*) we obtain in this way an enumeration of ®,,, in contradiction to Trahten- 
brot's theorem. CJ 


Theorem 5.5 is due to Gédel. It is a stronger version of a result obtained 
in IX.1, There we concluded from the failure of the compactness theorem for 
second-order logic #,, that there cannot be any correct and complete proof 
calculus for Y,,. In other words, there is no calculus whose derivability 
relation | satisfies 


(+) For all @,-sentences ¢ and all sets ® of sentences, 
DE giff OFT @. 


However, (+) leaves open the question of whether there is a calculus which 
satisfies (+) for ® = @%, that is, whether there is a correct calculus in which 
all valid second-order sentences are derivable. Now 5.5 shows that in this 
sense second-order logic is also incomplete: If such a calculus existed, one 
could apply its rules systematically to generate all possible derivations and 
hence all valid second-order sentences (cf. the proof of 1.6). 

At this point we see how useful it has been to introduce the notion of 
enumerability: By employing this notion we were relieved of the task of 
giving precise definitions for the notions of derivation rule and calculus, but 
were nevertheless able to conclude that there is no adequate proof calculus 
for the valid second-order sentences. 

The argument for 5.5 above is based on the fact that the finite sets are 
characterizable in second-order logic. Thus it can also be applied to weak 
second-order logic (cf. [X.1.7). 


For the sake of simplicity, we have, in the last section, referred to the 
symbol set S, although we have actually needed only a few symbols from 
S,. It should be clear that the results are also valid for other symbol sets 
S which are effectively given as is S,, and contain the symbols mentioned 
above. One can even show that it is sufficient for S to contain only one 
binary relation symbol. Moreover, the incompleteness of second-order logic 
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already holds for S = @ (cf. 5.6). On the other hand, the set of valid first- 
order S-sentences is decidable provided S contains only unary relation 
symbols (cf. XI.3.9(b)). 


5.6 Exercise. The set of valid second-order @-sentences is not R-enumerable. 


86. Theories and Decidability 


In this section we investigate several theories, especially with regard to 
enumerability and decidability. Among the results obtained is the unde- 
cidability of arithmetic. We shall always assume that the symbol sets con- 
sidered are effectively given. 


A. First-Order Theories 


6.1 Definition. T < L> is said to be a theory if T is satisfiable and if it is 
closed under consequence (i.e., every S-sentence which follows from T 
already belongs to T). 


For every S-structure Y the set 
Th(M) = {9 € LOWE g} 
is a theory, the theory of W (cf. VI.4.1). Th(9) is called (elementary) arithmetic. 
For ® cL} let © :-={peEL3| f= op}. If T is a theory, then T = T, 


and if ® is a satisfiable set of S-sentences, then ®” is a theory. We give a few 
examples. 


(1) @ = {pe Lj | 9}. 

(2) For S = S,,: (first-order) group theory Th,, = ®;,. 

(3) For S = {£}: ZFC set theory Thzp¢:= ZFC". 

(4) For S =S,,: the so-called (first-order) Peano arithmetic Thp, = Dp,. 
The axiom system @®,, consists of the Peano axioms given in IJI.7.5, 
where the usual induction axiom (a second-order sentence) is replaced 
by the first-order "induction axioms" (*) below: 


Vx ax+ 120, VxVWxt+1=eyv+1>x=y), 
Vxx+0=x, Vx¥yx4+G4 D224 y)4+1, 
Yxx-O=0, VxVyx-(vyt+1)=xX-y4+x, 


(*) and for all Xo,...,X,-1, ) and all @eL** such that free(p) ¢ 
{X, ,..., X,—1, ¥} the sentence 


0 »+ 1] 
Nvacees vini((0f A NG > @ +t) = veo), 
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Vt is a model of Dp,. The schema (*) is a natural substitute for the induction 
axiom, because it expresses the induction axiom for properties which are 
definable in first-order logic. Many theorems of elementary arithmetic (i.e., 
sentences in Th(t)) can be derived from ®,,. Nevertheless it turns out that 
not all sentences of Th(‘t) are derivable from ®p,: in 6.10 we shall show that 
®p, S Th). 


6.2 Definition. (a) A theory T is said to be R-axiomatizable if there is an 
R-decidable set ® of sentences such that T = 0". 

(b) A theory T is said to be finitely axiomatizable if there is a finite set ® of 
sentences such that T = O'. 


Every finitely axiomatizable theory can be axiomatized by means of a 
single sentence. (Take the conjunction of the axioms.) Every finitely axio- 
matizable theory is also R-axiomatizable. The theories Thp, and Thzg¢ are 
R-axiomatizable, but it can be shown that they are not finitely axiomatizable. 


6.3 Theorem. An R-axiomatizable theory is R-enumerable. 


ProoF. Let T bea theory and let ® be an R-decidable set of S-sentences such 
that T = 0’. The sentences of T may be listed as follows: Generate syste- 
matically all derivable sequents and check in each case whether the members 
of the antecedent belong to ®, If so, list the succedent provided it is a sentence. 


LO 


An R-axiomatizable theory T need not necessarily be R-decidable. 
Examples are T = @* (for S = S,; cf. 4.1) and T = Th,, (cf. [28]). The 
situation is different, however, if T is complete in the following sense. 


6.4 Definition. A theory T < L$ is complete if for every S-sentence @ we 
have pe Tor Tq@e T. 


Th(2) is complete for every structure YI. 


6.5 Theorem. (a) Every R-axiomatizable and complete theory is R-decidable. 
(b) Every R-enumerable and complete theory is R-decidable. 


PROOF. By 6.3 it is sufficient to prove (b). Let T be an R-enumerable complete 
theory. In order to decide whether a given sentence @ belongs to T, we use a 
procedure to enumerate 7, continuing until either gm or 7@ has been listed. 
Since T is complete, one of these two sentences will eventually be listed. If 
~ is listed, belongs to T; if 7@ is listed, p does not belong to T. LJ 


From 6.5 we obtain the decidability of an axiomatizable theory once we 
have proved its completeness. A method for proving completeness will be 
introduced in the next chapter. In certain cases one can use the assertion in 


WE Or? Mi als Se eaten al 
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6.6 Exercise. Let T = ©" be a theory, where ® is R-enumerable. Show that 
T is R-axiomatizable. (Hint: Starting with an enumeration @o, @;,... of ®, 
consider the set {@9, Po A @1,---}-) 


6.7 Exercise. (a) For at most countable S, let T < L3 be a theory having 
only infinite models. Further, suppose there is an infinite cardinal kK such 
that any two models of T of cardinality k are isomorphic. Show that T 
is complete. 

(b) Set up a decidable system of axioms for the theory of algebraically 
closed fields of fixed characteristic and use (a) to show its completeness 
(and hence by 6.5 its decidability). 


B. The Undecidability of Arithmetic 


In this section we prove the undecidability of arithmetic, i.e., we show that 
there is no procedure which decides for every $,,-sentence whether it holds 
in &. We shall use the same method of proof as in showing the undecidability 
of first-order logic: we effectively assign to every register program P an 
S,,-sentence @p such that 


NE pp iff P: — halt. 


The undecidability of Th(9t) then follows immediately from the undecid- 
ability of [,a- 

In defining wp we shall make use of a formula Wp which, in ‘%, describes 
how the program P operates. The following lemma provides us with such a 
formula. 

Assume the program P consists of the instructions &,..., %,, and let n 
be the smallest number such that all registers mentioned in P are among 
Ro,.--, R,. Recall (cf. §4) that a configuration of P is an (n + 2)-tuple 
(L,Mo,.-.,™,) of natural numbers such that L < k. (L, mp,..., m) stands 
for a situation where «, is the next instruction to be executed and the contents 
of the registers are mo,..., M,.- 


6.8 Lemma. With any given program P one can effectively associate aformula 


Wr(Vo,..+5 Van42) Such that forall ko,...,k, L,mo,...,m, € N thefollowing 
holds: 


Ve Welko,...,k,,L,mo,..., m] iff P, beginning with the con- 
jiguration (0, k,, ...,k), after jinitely many steps reaches the 
conjiguration (L, mg,..., m,). 


Using Wp, we can write down the desired formula @p as 


a 1 
Pp i= dv, +2 PRES Wan42 W p(0, <2 ., 0, k, Unt 2eeees Von+2)- 


n nm 
1Tn case p € LS", for example, we write g(n, »,) for @ — and @(n, m) for @ ——- Here, as before, 
Vo Uo? 


n denotes the corresponding term 1 +... + 1. 
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Then we have (note that « is the halt-instruction of P): 


N= gp iff P, beginning with the configuration (0,...,0), 
after finitely many steps reaches the configuration 
(k, M9,---,m,) for some mMp,..., mM 


iff P: L] > halt. 


3 tt 


Thus we have 


6.9 Theorem (Undecidability of Arithmetic). Arithmetic, i.e., the S,,-fheory 
Th(9), is not R-decidable. LI 


Since Th(%t) is complete, using 6.5, we obtain 


6.10 Corollary. Arithmetic is neither R-axiomatizable nor R-enumerable. L1 


According to 6.9 and 6.10 arithmetic is not amenable to a purely 
"mechanical" treatment in the following sense: There is no procedure for 
deciding whether any given arithmetical sentence is true, nor is there even a 
procedure which lists all true arithmetical sentences. In other words, every 
procedure which lists only true arithmetical sentences must necessarily omit 
some true arithmetical sentences. Thus mathematicians will never possess a 
method for systematically proving all true arithmetical sentences. 


PROOF OF Lemma 6.8. Let P be given as above. We must find an S,,-formula 
Wp(Xo,---3XnsZ Vows +s Vn) (= Wp(X, z, ¥)) which says (in 9) that P, beginning 
with the configuration (0, X), proceeds through a series of configurations, 
ending finally with the configuration (z, y). That is, Wp(X, z, y) should be a 
formalization of the following statement: 


"There is an se N and a sequence Cy,..., C, of configurations 
such that 


(1) Co = 0, X0,--65 Xn) Cs = (2 Yon +s Yad 
and for all i< s, C; = Can 


“C; = Ci41” means that P passes from configuration C; to C;4, When 
executing the instruction addressed in C;. We form a single sequence from 
Co,.--,C, and thus obtain the following formulation of (1): 


"There is an s € N and a sequence 


(ao, +265 4n41> n+ 25> »9 int 2)4+(n+1)>° 2 Asn 2jrerrs Os.in42)+(n+1)) 
me mm re” —_—_—_—_———————— ee” 


Co C, C, 
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(2) such that 
Ay = 0,4, = Xo,---5 Ang = Xn; 


Asin+ 2) = 2, Asn gs 2)+1 = Vor-+ +> Asqn+2)+(n+1) = Vans 


and for alli< s, 


” 
(Gi.in+2)> »+ +5 Ging 2)4(n4 1)) ? (Qa+ 1)-(n+2)9 +9 Att) (nt 2)4(n4 1)) . 


The principal difficulty in formalizing (2) as a first-order S,,-sentence 
arises with the quantifier "there exists a sequence...”. We overcome this 
problem by using natural numbers as codes for finite sequences. Often one 
codes a sequence (a,, ...,a,) by the number p@?*?..... pa'*! where p; 
denotes the ith prime. However, when using this code, we would be forced to 
give an L**:-definition of exponentiation. Since such a definition is rather 
involved, we provide another coding where a sequence (a,, ...,a,) is coded 
by two suitably chosen numbers t and p. 


6.11 f-function Lemma.’ There is afunction 6: N* > N with the following 
properties : 


(a) For every sequence (a,, ...,a,) over N, there exist t, p € N such thatfor 
i<r 
B(t, p, i) = a,j. 
(b) There exists an S,,-formula y(v9, 01,02, v,) which defines B in Yt in the 
sense that jor t, pp ,a EN, 
NE vit pia) itt BU p, i) =a. 


PROOF. (a) Given (a,, ...,a,), we choose a prime p which is larger than 
a a r+ 1 and set 


(x)  te=1-p° + agp! + 2p? + a,;p? +---+(r + Lp + a,p?"*". 
By choice of p the right-hand side is the p-adic representation oft. 


First we show that for alli <r 


a=a, iff there are by, b,, b, such that 
(i) t= bo tb, + 1) tap t+ b2p*), 
(**) (ii) a <p, 
(ili) by < by, 
(iv) by =p” 
The implication from left to right follows immediately from (*). Conversely, 
suppose (i)~(iv) hold for bp, by, b2 and let b, = p?'. From (i) we obtain 


t=bo $a +1) p+ ap?) + bop*'*?. 


for a suitable /. 


? This nomenclature stems from Gédel’s use of Bf for a function with the properties (a) and (b) 
of the lemma. 
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Since by < p?!, a < p, and the p-adic representation of t is unique, a com- 
parison with (*) yields! = ianda = a;. Weset f(t, p, i) = a;,1.-e., the uniquely 
determined number a for which the right-hand side of (**) holds. We extend 
this definition to arbitrary natural numbers r, q, j by specifying 


the smallest a such that there exist by, b,, b. with 
(i) r= bo t+ biG + 1) + aq + b2q°) 
(ii) a<q 
Br, 4.) =4 (iii) by < b, 
(iv) b, = q?' for suitable /, 
if such an a exists and q is prime 
QO, otherwise. 


Then f has the properties required in (a). 

(b) The definition just given leads immediately to an S,,-formula 
¥(vo, U4, V2, V,) defining f; one need only note that (iv) is equivalent to the 
condition that b, be a square and that for alld # | withd|b,; wehaveq/ld. LU 


We now return to the proof of 6.8, that is, to the problem of giving an 
S,,-formula which says that the program P passes in finitely many steps from 
the configuration (0, X) to the configuration (z, y). As we have seen, this 
statement about P is equivalent to statement (2). We can formalize (2) with 
the aid of the formula y from the /?-function lemma (where we now use 
s,t,...to denote variables): 


W(X, ---1Xns Zs Yoo++ +s Yn) 

i= ds dp Je(x(t, p, 0,0) a x(t p, 1, Xo) A... A M(t D+ 1, xy) 

A X(t, p, 8° (n+ 2), Zz) a Xt, p,S*M+2) + Ly) A... 
A X(t, p, S-(n_+ 2) + (n+ 1), Vn) 

A ViVuug...u, Vuug... uj, 

li<sa x(t, pi.(n +2) u) a x(t, p.i.(2#+.2) $ luo) a... 

A X(t, p, i. (a+ 2) + (n+ 1), 4,) 

A x(t, pa + 1).(44.2),u) a x(t, p,d +1). (2 +.2) + Luo) a... 
At, p» i + I) (a t_2) + (n_ +1), uy) 

= (U,Uo ss aig UL) Sp CW, Hoses Uy) D)- 


Here 
“(U, Ug, +++ Un) se (UY, Uo, ++, Un)” 


stands for a formula which describes the direct transition from configuration 
(U, %,...,U,) to configuration (u', ug,..., 4); such a formula can be 
obtained as a conjunction Wo A... A W,—1, where W; describes transitions 
induced by instruction «,; of P. For example, if «; is of the form 


j LET R, =R, +1 
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then we take 
WeeusfoW=utlauw =u rau Seu, +tlauw=ma--- 
A u, = U,). 
Instructions of other type can be treated similarly. Thus a formula ¥, with 


the desired properties is obtained, and the proof of 6.8 is completed. LK 


Finally, we note another consequence of the fact that computations of 
register machines can be described in ‘ft. 


6.12 Theorem. Let r > 1. 


(a) Given an r-ary R-decidable relation 2 over N, there is an S,,-formula 
P(¥o,.+.+,0,—4) such that for all kg,...,k,_5 € N, 


DQko... kay iff RE plko,---.kr-1). 
(b) Given an R-computable function f: N' > N, there is an S,,-formula 
(Vo, «+ +5 Vp 1s Ur) SuCh that for all ky,...,k,-1,k, EN, 
f r eet f iff NR p{Ko.- +s K-15 ky), 
and in particular, 
Nt aad che v, P(ko, fae Kpais U,). 
PROOF. (a) Suppose r = | and let & be an r-ary R-decidable relation over 
N. Let P bea register program which decides Q and %,,,..., %,, be the print- 
instructions of P. Suppose that R, is the largest register mentioned in P, 


and without loss of generality, that n > r — I. Then, using Wp from 6.8, we 
have for arbitrary ko,...,k,-,¢ N: 


Qko...k,-1 iff P, beginning with configuration 
(0, kos -+5 ky—1,0,--+5 9), 
ee een 
n—r-4+l 


after finitely many steps reaches a configuration 
of the form (L,,0,m,,...,m,) withO 7 i J! 
iff Jt dy43.-. Vaya 


(Wr(Ko, gaigehegeg Oh 05 Oy dogs 0, Unease is Dowad) 
Vena Wak Kos ids Regs Os se Oe by 20: Una3o+« +s VIn+2))- 


Thus for @(tg,...,U,—-1) one can take the formula 


JU,43 -++U2n+2 V Wrlvo,-- +5 Ura, Ons, L;, 9, Unt3o+++> V2n+2): 
i=0 
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(b) We proceed as in (a), noting that 
f(Ko,---5k--1) =, iff P, beginning with configuration 
(Oy Reece hie 0); 


after finitely many steps reaches a con- 
figuration of the form (L,, k,, m,,..., m,) 
with O 7 iT 1. 


Hence the required formula @(v9,...,0,-1, ¥,) can be chosen as 


l 
Jn +3 +++ Van+2 \/ Wr(Uo,---» Up —~15 O522330, Li, v,, Deg ascats Vinee) O 
i=0 


Relations and functions over N which can be described by an S,,-formula 
as in 6.12 are said to be arithmetical. Thus 6.12 says that all R-decidable 
relations and all R-computable functions over N are arithmetical. 


$7. Self-Referential Statements and Gédel’s 
Incompleteness Theorems 


In the preceding section we have shown that arithmetic is not R-axiomatiz- 
able. Originally G6del [13] used another method to prove this result. He 
showed that within sufficiently strong axiom systems there are self-referential 
formulas, i.e., formulas which make statements about themselves. Such 
self-referential formulas are the main theme of this section. We shall close 
by taking up our original objective of this chapter and obtain some important 
results concerning the limitations of the formal method. With this aim in 
mind we shall often conduct the arguments on the syntactic level. 
In the following we take ® to be a set of S,,-sentences. 


7.1 Definition.(a) A relation {2 < N" is said to be representable in ® if there 
is a formula @(vo, ..., ¥,-;) € L>** such that for all ng,...,n,-5€ N: 


Qing... ny— 1 implies ® t- P(No, -- +5 Mp1), 
not Qing...n,-, implies TO@(Ho,...,,—1). 


In this case we say that @(v9,..., 0,-,)} represents 2 in ®. 
(b) A function F: N’ > N is said to be representable in ® if there is a formula 
(vo9,...,v,) € L*, such that for all mo,...,",-1,n, EN, 


F(fo,...,M,-1) =A, implies PF —lno,...,H,); 
F(no,-.+,"%,-1} AN, implies DE T7PHy,..., 4,)3 
a, = a v,P(No, ag HAG Vy pe 


In this case we say that g(vy9,..., v,) represents F in ®. 
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7.2 Lemma. (a) [f® < ®' < L3* then the relations and functions representable 
in ® are also representable in ©", 

(b) If D is inconsistent then every relation and every function is representable 
in ®, 

(c) Let ® be consistent. If ® is decidable then every relation representable in 
® is decidable and every junction representable in ® is computable. 


All assertions follow directly from definition 7.1. To prove (c), note that 
the set of formulas derivable from a decidable set ® is enumerable. LJ 


For ® c L3* we define 


Repr ® iff all R-decidable relations and all R-computable 
functions on N are representable in ®. 


Repr ® says, in a certain sense, that ® is rich enough to describe how pro- 
cedures operate. In the preceding section we have described the execution 
of programs in ® := Th(3t). Indeed we have 


7.3 Proposition. Repr Th(?). 
The proof is immediate from 6.12 if one notes that for every S,,-sentence @, 


NE gp iff THY 
and 


not NE @p iff Th) 4g. O 


A closer analysis shows that the arguments in the previous section which 
led to a description in Th({3t) of the execution of programs, can actually be 
carried out in ®,,. Thus one can obtain 


7.4 Proposition. Repr ®p,. O 


As an important technical means we assume in the following that an 
effective coding of the S,,-formulas by natural numbers (a Godel numbering) 
is given. and moreover that every number is the G6dde] number of some 
formula. We write n® for the Godel number of @. Conversely, we let @, 
denote the formula with Gddel number n. In this way it is possible to translate 
statements about formulas into arithmetical statements. For example, a 
statement about the derivability of a formula @ becomes an arithmetical 
statement about the Géddel number of g, and this in turn can be formalized 
as an S,,-sentence, 

The way we shall proceed originates from the liar's paradox, thereby 
leading, on a formal level, to a clarification of the problems which lie behind 
this paradox. The paradox of the liar amounts to the fact that the statement 


(*) "Tam not telling the truth" 
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can neither be true nor false; for if it were true it would have to be false, and 
if it were false it would have to be true. Note that (*) makes a statement 
about itself, and hence is an example of a self-referential statement. 

As a first step we consider statements of this kind in general. We show 
that within a sufficiently rich system, every property which is expressible in 
the system gives rise to a self-referential sentence; more precisely: 


7.5 Fixed Point Theorem. Suppose Repr ®. Then, for every w € L}", there 
is an S,,-Sentence ~ (= @y) such that 


Dr pon). 
Intuitively, p says" I have the property ¥". 


Proor. Let F: N x N > N be given by 


xm) 
nv, ifn =n’ for some x € Li 


ee 10. otherwise. 
Clearly, F is computable, and for x € L}** we have 
F(n*®,m) = n*™, 
Since Repr ®, F can be represented in ® by a suitable formula a(v9, 01, 02) € 
LS", We write x, y, z for v9, v,, v,. For given w € L}** we set 


B = Vz(a(x, x, z) > W(z)), 


p= V2(a(n* n® 2) > W(2)). 
B 
Since Be L3** and g = p we have F(n?, n’) = n€ and hence 


(1) DE a(n’, n°, n°). 
We now show in two steps that 
Dr po pn’). 


First, by definition of ¢, 
Dv {yp} a(n’ n? n?) > Wn). 


By (1), it follows that + @ — Wn). Since « represents the function F in 
@, we have, on the other hand, 


DE Vtza(n’, n', 2); 
thus by (1) 
Dt Wz(a(n’, n’,z)—> z = n°), 
and therefore = 
DE YWn®) — V2(a(n", n*, z) > WZ), 
that is, 
Dr Wn*) = 9. 
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Let us now turn to systems which are rich enough to contain statements 
about the "truth" or "falsity" of statements formalizable in the system. The 
following theorem shows that in such a system one cannot classify all state- 
ments as either "true" or "false". Formally, we consider an axiom system @; 
The "true" statements correspond to the sentences in 


®' = {pe L3"|OF 9g}, 


the "false" statements to the negations of sentences in ®". To say that one 
can speak of "truth" in ® is to say that ®' is representable in ® (more 
precisely, that {n®|p ¢®"} is representable in ®)., If ® satisfies this latter 
condition then, as we shall show, there is a sentence @ such that neither » 
nor @ belongs to ®". 


7.6 Lemma. Let ® be consistent and suppose Repr ®, If ®" is representable 
in ®, then there is an S,,-sentence ~ such that neither D + @ nor ®t N@. 


PROOF. Suppose x(vg) € L¥** represents the set ® in ®. Then in particular, 
for ae Le, 


DE y(n") iff DE a, 
For yw = 7 y we choose, by 7.5, a "fixed point” @ € L3** such that 
(*) Or pe 1yn"); 
@ says intuitively "I am not true”. 

As in the paradox of the liar, we now obtain that neither P+ q@ nor 
®t “9. For if PE @g then D+ y(n®) and hence by (*), DE “4, that is, 
® is inconsistent, contrary to our initial assumption. On the other hand, if 


®t “@ then by (*), ®t y(n?) and therefore ® t+ ~, and ® would again be 
inconsistent. L 


Lemma 7.6 has interesting consequences both on the syntactical and 
semantical levels. In semantical formulations one usually refers to ®* 
instead of ®', 


7.7 Tarski’s Theorem [26]. (a) Suppose ® is consistent and Repr ® holds. If 
®" is representable in ®, then ®* is not complete. 
(b) Th(9) is not representable in Th(%). 


PROOF. (a) Since ®* = ®*, (a) follows immediately from 7.6. 
(b) follows from (a) by setting ® = Th(9) and noting that Th(9t) = 
Th(M)™ is complete. L] 


Tarski’s theorem is of great significance in the study of semantics. Part 
(a) says that for a sufficiently strong system the following two conditions 
cannot hold simultaneously: 


(1) Every statement in the system is either true or false (“®*™ is complete"). 
(2) Truth is expressible in the system (“®* is representable in ©”), 
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The paradox of the liar arises from the tacit assumption that both conditions 
(1) and (2) hold for everyday language. 

Part (b) of Tarski’s theorem can be formulated succinctly as "there is no 
truth definition for arithmetic within arithmetic”. 

Like Tarski’s theorem, Gédel’s first incompleteness theorem is also a 
consequence of lemma 7.6 (cf. 6.10). 


7.8 Gédel’s First Incompleteness Theorem. Let © be consistent and R-decidable 
and suppose Repr ®. Then there is an S,,-sentence ~ such that neither ® | @ 
nor Dt T@. 


Proor. Suppose that for every S,,-sentence g, either Dt g or DE 7g. 
Then ®" is decidable. Hence, by Repr ®, ®” is representable in ® in con- 
tradiction to 7.6. LJ 


A refinement of the above discussion leads to results concerning the 
consistency of mathematics. In particular, G6del’s second incompleteness 
theorem, which we shall derive, shows that the consistency of a sufficiently 
rich system cannot be proved using only the means available within the 
system. 

Let ® c Lé** be decidable such that Repr ®. We choose an effective 
enumeration of all derivations in the sequent calculus associated with S,, 
and define a relation H c N x N by 


Hnm_ iff the mth derivation ends with a sequent of the form 


Wo -.-Wre-1@na, Where Wo,...,We-1€@ and @, is 
(as before) the nth formula in the Gédel numbering 


of S,,-formulas. 
Since ® is decidable, so also is H, and clearly, 
M+ g_ iff there is me N such that Hn°m. 


As a decidable relation, H can be represented in ® by a suitable formula 
Qu(vo, 01) € L3**. Again we write x, y for vg, v, and set 


Dero(x) = Iypuy(x, y). 
For ¥ = Derg(x) we choose a fixed point p € L3": 
(*) ® t- po 7Derg(n?®). 
@ says intuitively "I am not provable from ®” 
Then we have 
7.9. If ® is consistent then not D- @, 


PROOF. Suppose ® + g holds.Choose msuch that HnYm. Then ®+ @,(n®, m), 
and so M+ Derg(n®). From (*) we have Dt “@, and hence ® is incon- 
sistent. LJ 


87. Self-Referential Statements and Gédel’s Incompleteness Theorems 177 


Since Repr ®, the consistency of ® is equivalent to "not D} 0= 1”. 
(Note that in case ®+ 0 = 1 every representable set which contains 0 
would also contain 1.) The S,,-sentence 


Consisg := —Derg(n® *') 


thus expresses the consistency of ®. 7.9 may then be formalized as the 
S,,-sentence 


Consisg > 7 Derg(n®). 


An argument which is in principle simple, though technically rather tedious 
could now be used to show that the proof of 7.9 can be carried out on the 
basis of ®, in case ® > Mp, (and a natural representation My(vo, v,) of H has 
been chosen). This means: 


(**) D+ Consisg > 7 Derg(n?). 


We then can deduce 


7.10 Gédel’s Second Incompleteness Theorem. Let ® be consistent and 
R-decidable such that Ppy < ®. Then 


not D | Consisg. 


ProoF. If ® + Consisg then by (**) ® 7 Derg(n®) also. By (*), BL po 
1 Derg(“”) and hence P+ 4g, in contradiction to 7.9. LJ 


For ® = ®,, Gédel’s second incompleteness theorem says intuitively 
that the consistency of ®p, cannot be proved on the basis of Pp,. This result 
shows that Hilbert's program cannot be carried out in its original form. In 
particular this program aimed at a consistency proof for Pp, using only 
finitistic means. The concept "finitistic", though not defined precisely 
(cf. [18], p. 32), was taken in a very narrow sense; in particular it was required 
that finitistic proof methods be carried out on the basis of Dp,. 

The above argument can be transferred to other systems where there is a 
substitute for the natural numbers and where decidable relations and com- 
putable functions are representable. In particular, they apply to systems of 
axioms for set theory such as ZFC. One uses the natural numbers as defined 
in ZFC. Then one can define an {e}-sentence Consiszp¢, which expresses 
the consistency of ZFC, to obtain 


7.11 Theorem. If ZFC is consistent then not ZFC + Consiszgc. LC 


Since present-day mathematics can be based on the ZFC axioms, and 
since "not ZFC} Consisz¢c” says that the consistency of ZFC cannot be 
proved using only means available within ZFC, we can formulate 7.11 as 
follows: If mathematics is consistent, we cannot prove its consistency by 
mathematical means. 
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In a similar way also Tarski’s theorem and Gédel’s first incompleteness 
theorem can be transferred to axiom systems for set theory. For example, 
7.8 would then assert that for every decidable and consistent system of 
axioms ® for set theory which contains ZFC, there is an (5)-sentence y such 
that neither P+ yw nor ®+ —wy. Intuitively this means that there is no 
decidable consistent system of axioms for mathematics which, for every 
mathematical statement, allows us to either prove it or disprove it. In this 
fact an inherent limitation of the axiomatic method is manifested. 


CHAPTER XI 


An Algebraic Characterization of 
Elementary Equivalence 


The greater part of our exposition so far has been devoted to the development 
and investigation of first-order logic. We can justify the dominant réle 
assumed by first-order logic in several ways: 


(a) First-order logic is in principle sufficient for mathematics. 

(b) The intuitive concept of proof and the consequence relation can be 
adequately described by a formal notion of proof, which is given by 
means of a calculus. 

(c) A number of semantic results such as the compactness theorem or the 
Lowenheim-Skolem theorem lead to an enrichment of mathematical 
methods. 


However, in contrast to these positive aspects, one also has to take into 
account that the limited expressive power of first-order language often 
requires clumsy formulations. In particular, it forces us to make explicit 
reference to set theory to an extent not usual in mathematical practice. For 
this reason we were led to seek other systems with greater expressive power 
but still satisfying conditions (b) and (c). We introduced a number of exten- 
sions of first-order logic (4), 27, “0,0,% 9) and investigated their 
semantic properties. In each case we found (cf. [X) that not all the properties 
mentioned in (c) are available. 

In the previous chapter we obtained negative results of a more syntactic 
nature. For example, we saw that for #, and for #y there is no possibility 
of adequately describing the notion of proof by means of a calculus; hence 
in these cases we also have to make concessions concerning (b). 

The discussion in the last two chapters will show that these negative 
results have a deeper reason: Having made precise the concept "logical 
system" we shall prove in Chapter XII that no logical system with more 
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expressive power than first-order logic can meet the conditions of (b) and 
(c). 

In the present chapter we introduce a useful tool for these investigations. 
Recall that two structures are elementarily equivalent if they satisfy the 
same first-order sentences. We now present a purely algebraic characteriza- 
tion of elementary equivalence. This characterization is useful not only for 
our present purpose but also in other contexts. For example, in many cases 
it can serve to verify that two given structures Yl and 23 are elementarily 
equivalent (in a simpler way than by proving directly that YW and 2 satisfy 
the same first-order sentences). 


1 Partial Isomorphisms 


In this section we provide the concepts we need in order to formulate the 
algebraic characterization of elementary equivalence. We refer to a fixed 
symbol set S. The domain of a map p is denoted by dom(p), its range by 
rg(p). 


1.1 Definition. Let YI and 23 be (S-)structures and let p be a map. p is said to 
be a partial isomorphismfrom Y to 2 if and only if dom(p) < A, rg(p) < B 
and p has the following properties: 


(a) pis injective. 
(b) pis homomorphic in the following sense: 
(1) For n-ary Pe S anda,, ...,a,-, € dom(p), 


P™a)...a 41 iff P®p(ay)... Man-1). 
(2) For n-aryf €S anda,, ...,a,-1,a € dom(p), 
FMAo,...,%-1) =a iff S?(p(ao),..., P@n—1)) = PQ). 
(3) Force €S and a€dom(p), 
ci=a_ iff c® = p(a). 


We write Part(%, 8) for the set of partial isomorphisms from I to 8B. 


1.2 Examples and Comments. 


(a) The empty map, 1¢., the map with empty domain, is a partial iso- 
morphism from YF to 23. 


(b) The map p with dom(p) = {2, 3) and p(2) = 2, p(3) = 6 is a partial 
isomorphism from the additive group (R, +, 0) of real numbers to the 
additive group (Z, +, 0) of integers. On the other hand, the map gq with 
dom(q) = {2, 3) and q(2) = 1, q(3) = 2 is not a partial isomorphism from 
(R, +, 0) to (Z, +, 0), because 2 + 2 # 3 but g(2) + G2) = q(3). 


$1. Partial Isomorphisms 181 


(c) If S is relational, that is, if S contains only relation symbols, then for 
d,, ...,4-,€A and Bb, ...,5,-,; € B the following two statements are 
equivalent: 


(*) By setting 
p(a;) = 5; (i<r), 


a partial isomorphism from Y to B is determined (where dom(p) = 
(do, oom a,—1} and rg(p) = {bo, ar) b,- t}). 


(x*) For every atomic formula we L’, 


WE Wlap,.-.,4,-1] iff BE Wlbo...., 5-1]. 
PROOF. First we note that for i,j < r 


a) aj=a; iff We v, = vjlao,...,4,-1], 
b= b; iff Be v, = vfbo,...,b,-11, 
and that for n-ary Pe S andi,, ...,i,-; <r 
Q) Pets 3, Ti MUP ot sli ati O27) 
Pbk Dic Aft BS Pie cc Oi. esd Dea): 


Now, if («*) holds, then by (1) and the fact that 
We v, = %l40....,a,_,] iff Bev, = Desa gOpsy ls 
p (as given in (*)) is well-defined and injective. Since 
WS Piette fags ciey Opa). “ih Be Pip eae (Doster Opals 


and by (2), p is also homomorphic. 
Similarly one can use (1) and (2)to deduce (**) from (*). LO 


(d) Note that the equivalence in (c) is no longer true if S contains function 
symbols or constants. For example, for the partial isomorphism p in (b) 
not (R, a, 0) - Vo Fe (vo 1 Uo) = v,[2, 3], 
but on the other hand 


(Z, +, 0) Uo + (Yo + Uo) = vilp(2), p3)I- 


(e) The following example shows that even for relational S a partial 
isomorphism does not in general preserve the validity of formulas with 
quantifiers. 

Let S = {<} and let gy be the partial isomorphism from (R, <)to(Z, <) 
such that dom(qg,) = {2, 3} and go(2) = 3, qo(3) = 4. Then 


(R, <) -E Jv2(v9 <b2 Av2< v1) 2: 3] 
but 
not (Z, <) — dv2(¥9 < v2 A v2 < Ly) Lgo(2), Go(3)]. 
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If pis a partial isomorphism from(R, <)to(Z, <)such that dom(p) = {a, b) 
and a < b, then we always have 


(R, <) FF Av(v9 < U2 A002 < v;) La, bl, 


since, for example, 


. b 
(R, <)& 09 < 02 A 02 < ofa, | 


In this case the validity of 
(+) (Z, <) & Jv2(v9 < v2 A v2 < v;)L PC), p()] 


is equivalent to the existence of a partial isomorphism q from (R, <) to 
(Z, <) which extends p and has (a + b)/2 in its domain. For, if such a q 
exists, then (+) holds, since 


a+b 


(Z, <)& 9 < 02 A V2 < vole, q(b), (| 


conversely, if (+) is satisfied and, say, 
(Z, <)E U9 < v2 A v2 < v;[ pla), p(b), d], 


then the extension g ofp with dom(q) = {a, b, (a + b)/2} and q((a + b)/2) =d 
is such a partial isomorphism. 

This argument indicates that the truth of formulas with quantifiers is 
preserved under partial isomorphisms provided that these admit certain 
extensions. It embodies the basic idea behind the algebraic characterization 
of elementary equivalence: The elementary equivalence of structures amounts 
to the existence of extensions of certain partial isomorphisms. 

In the following we identify a map p with its graph {(a, p(a))|a € dom(p)}. 
Then p < q means that q is an extension of p. 


1.3 Definition. 8 and 23 are said to be finitely isomorphic, written WU = , 23 
iff there is a sequence (I), with the following properties: 


(a) Every I, is a nonempty set of partial isomorphisms from YW to B. 

(b) (Forth-property) For every pel, , and a€A there is q€ I, such that 
q > pand ae dom(q). 

(c) (Back-property) For every pél,, , and be B there is qe I, such that 
q > pand be rg(q). 


Informally we can express (b) and (c) as follows: partial isomorphisms in 
I,+, can be extended (n + 1) times; the corresponding extensions lie in 
I,, 1,-1.---,1,, and Ig, respectively. If (I,),-n has the properties (a), (b), 
and (c), we write (I,),.,: =, B. 


1.4 Definition. 2{ and 8B are said to be partially isomorphic, written W =, 23, 
iff there 1s a set J such that 
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(a) Lis a nonempty set of partial isomorphisms from YW to 23. 

(b) (Forth-property) For every pé€I and a€A there is qéI such that 
q > pand a € dom(q). 

(c) (Back-property) For every pel and b€B there is qeI such that 
q > pand be rg(q). 


Thus the conditions (a), (b), and (c) amount to (J),,,: YW =, 23 for the 
constant sequence (I),, y. 

EF (a), (b), and (c) are satisfied for I we write I: WU =, 8B. 

The following lemma lists the relations between the various notions of 
isomorphism. 


1.5 Lemma. (a) /F UW = B, then W =, 23. 

(b) If W=,BthenU =, B. 

(c) HW => Band A is finite then W = 23, 

(d) If YW =, Band A and B are at most countable then W = 23. 


Proor. (a) If 7: W = B then 1: W=,¥B for I = {zx}. 
(b) If: U =, Bthen (),.y: UW =, B. 


(c) Suppose (1,),,,: % 2,23, and suppose A has exactly r elements, 
A = {a,, ...,@,-1}. We choose pé/,,,. If we suitably apply the forth- 
property r times we obtain a qé€I, such that a,, ...,a,., € dom(q), ie., 


dom(q) = A. If rg(q) # B and be B — rg(q), then by the back-property 
there would be an extension q' ofqin I, such that b € rg(q’). Since dom(q) = A, 
this is not possible. Therefore rg(g) = B and thusq: WU = B. 

(d) Suppose I: W=,B, A = {a, a,,...$ and B = {bo, by,...). Starting 
from an arbitrary po € I, by repeated application of the back- and forth- 
properties, we obtain extensions p,, p2,... in I such that a, € dom(p,), 
by € rg(p2), a, € dom(p3), by € rg(p4),..., that is 


(1) Pac Pn+t> 
(2) if nis odd, say n = 2r + 1, then a, € dom(p,); 
(3) if nis even, say n = 2r + 2, then b, € rg(p,). 


By (1), p:= nen PB is a partial isomorphism from YW to B. As dom(p) = A 
(by (2)) and rg(p) = B (by (3)), we have p: 2 = 23. haa 


Part (d) of 1.5 is an abstract version of the following theorem of Cantor: 


1.6 Theorem. Any two countable dense orderings (without endpoints) are 
isomorphic. 


Here a dense ordering is a { < }-structure which is a model of ®,,,g, where 
®,,,4 contains the ordering axioms together with the following sentences: 
Wx Vy(x < y > dz(x <2zAz<_y)), 
Vx dyx <y, Vx ayy x: 
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(R, <) and (Q, <) are dense orderings. By contrast, (7, <) is not a dense 
ordering. 
Cantor's theorem follows from 1|.5(d) and 


1.7 Lemma. /f' 2= (A, <*) and 8 = (B, <®) are dense orderings, then 
I: WM =, Bforl = (pipe Part(%, B), dom(p) finite}. 


PROOF. Since p = @ is in I, I is not empty. I satisfies the forth-property. For, 
if pEI, dom(p) = {a,, ...,a,-,}, and a E A, then because 23 is dense there 
is an element bE B which is related to p(dg),..., p(a,—;,) in the ordering 23 
in the same manner as a is related to a,, ...,4,— 1 in the ordering YW. Then 
g :=p u {(a, b)) is an extension of p which is defined for a and lies in I. The 
back-property follows analogously, using the fact that Y is dense. = 


1.8 Example. Suppose S = {¢,0} and let ®, consist of the “successor 
axioms" 


¥x(7x = 0 dyay = x), 
Vx Vax = ay>x = y), 
and for every m = I: 
Yx 10...0X =x. 
nae aoe 
m-times 


The structure 9t, (cf. III.7.3(2)) isa model of @,. We show that any two models 
of ®, are finitely isomorphic. For a model Y of ®, and for a E A, let 


2 ae au) 
ee 
mi-times 
For every 1 E N we define a "distance function" d, on A x A by 
m if a” = a' and m < 2", 
d{a,@):=y—m ifa” =aand m I 2", 
Be otherwise. 


Now suppose Y! and 23 are models of ®,. We show that (I,),,,.  & = >¢ 23, 
where 
I, :={pe Part(%, B)|dom(p) finite, 0* € dom(p), 
and for all a, a'€ dom(p), d,,{a, a') = d,(p(a), p(a’))}. 


Thus a partial isomorphism in I, preserves the “d,-distances”. First we have 
I, 4 @ since {(O*. 0®)} EI,. We sketch a proof of the forth-property for 
(I,),,, (the back-property can be proved analogously). Suppose pe J,., , 
and uk A. We distinguish two cases, depending on whether or not the 
following condition (*) is satisfied. 


(*) There is an a' Edom(p) such that |d,(a’, a)| <2” 
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If (*) holds there is exactly one b € B for which p u {(a, b)) preserves the 
d,-distance (since p€1,,.,); ff (*) does not hold we choose an arbitrary 
element b such that d,{ p(a’), b) = x for all a'€ dom(p) (such an element 
must exist since every model of O, is infinite). In any case it 1s easy to show 
that q:=p u {(a, b)} EL. L 


1.9 Exercise. LetS = @. Show that any two infinite S-structures are partially 
isomorphic. 


1.10 Exercise. (a) Give an example of structures which are partially iso- 
morphic but not isomorphic. 

(b) Give an example of structures which are finitely isomorphic but not 
partially isomorphic. 


1.11 Exercise. Give an uncountable model of the system of axioms O, in 1.8. 


§2. Fraisse's Theorem 


Using the concepts introduced in $1, we now formulate the main result of 
this chapter. 


2.1 Fraissé’s Theorem. Let S be a finite symbol set and YW, 23 S-structures. 
Then 


Note that Fraisse's theorem provides us with a characterization of 
elementary equivalence which does not refer to first-order language. 

Before proving the theorem (in the next section) we give several examples 
showing how it can be used to check the elementary equivalence of structures 
and the completeness of theories. 


2.2 Proposition. (a) Any two dense orderings are elementarily equivalent. In 
particular, (R, <) = (Q, <). 

(b) Any two {a,0)-structures satisfying the axioms in 1.8 are elementarily 
equivalent. 


PROOF. (a) follows from 2.1, since every two dense orderings are partially 
isomorphic, and thus also finitely isomorphic; (b) follows analogously by 
means of 1.8. 


For some applications we need the following simple criterion for the 
completeness of theories. 
2.3 Lemma. Fora theory T < L@ the following are equivalent: 


(a) T is complete, i.e.,for every S-sentence ¢ either g € T or T@eE T. 
(b) Any two models of T are elementarily equivalent. 
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Proor. Suppose first that (a) holds, and let Y, 23 be models of T. For any 
S-sentence g either pe T or T@eET. If pe T then WE mand BE Qg; if 
79 €T thenYW— ogand BE nq. Thus UE pif BE @. 

Conversely, let @ be an S-sentence and suppose @ ¢ T. Since T is a theory, 
TE @ does not hold, and therefore there is a model WY of T U {79}. By (b) 
every model of T is elementarily equivalent to Qf, and thus is a model of “@. 
Hence T = 7@ and therefore “9 € T. LJ 


From 2.2, with the aid of 2.3 and X.6.5 we obtain 


2.4 Proposition. (a) The theory ®j,,4 of dense orderings is complete and 
R-decidable. (Thus,for example, ®3,,4 = Th(R, <).) 

(b) The theory ®F of "successor structures" is complete and R-decidable. 
(Thus for example, ®F = Th(N, 9.) 


Preparatory to the proof of Fraisse's theorem we show that we can restrict 
ourselves to relational symbol sets. 

Let S be an arbitrary symbol set. As on p. 120 we choose, for each n-ary 
f €S, a new (n + 1)-ary relation symbol F and, for each c € S, a new unary 
relation symbol C. Let S* consist of the relation symbols from S together 
with the new relation symbols. S’ is relational. For an S-structure Q, let 2’ 
be the S"-structure obtained from YW, replacing functions and constants by 
their graphs (as in VIII.1). 

When defining partial isomorphisms we treated functions and constants 
in such a way (cf. 1.1(b)) that 


Part(W, B) = Part(Q’, B’). 
From this we obtain 
(*) W=,B iff w~,B 
In VIII.1.7 we showed that 
(**) W= 8 iff Ww = BR" 


Thus, in proving Fraissé’s theorem, we can restrict to relational symbol sets. 
For, if Yf and 8 are given, it follows from 


W= Br iff w=, Be 
by (*) and (**) that 


2.5 Exercise. Show that forS = @ the theory of infinite sets, {g,,,|n = 2}", 
is complete and R-decidable. 
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2.6 Exercise. Let S = {P,]n € N} bea set of unary relation symbols. Define 
the S-structures 21 and ® as follows: A:=N,B:= Nu {x} P™:={m|meN, 
m > n},P3:={m|meN,m > n} uw {co}. Show that A = B but not A = , B. 
(Thus Fraisse's theorem is in general not true for infinite symbol sets. Note, 
on the other hand, that for arbitrary S and S-structures YW, 23 we have YW = B 
iff for any finite Sp < S, W [Sp = 2B [ So, that is. WP Sp =~ B PS.) 


§3. Proof of Fraisse's Theorem 


As a measure of the complexity of formulas we define the quantifier rank of a 
formula @ to be the maximum number of nested quantifiers occurring in it: 


qr(g) :—0, if @ is atomic; 
qr) :=qr(g); —gr(p v w) = max{qr(~), qr(W)}; 
qr(ax@) = qr(g) + I. 


For example, the formula 74x(Vy Rxz a Qy) a VzQz has quantifier rank 2. 
The formulas of quantifier rank zero are the quantifier-free formulas. 

In the sequel let S be a fixed finite relational symbol set. One half of 
Fraissé’s theorem amounts to 


3.1. /fU =, B then U= B. 


In order to prove 3.1 we must show for every S-sentence ¢ that 
Weep if BEY. 
We obtain this by applying the following lemma, taking r = 0, n = qr(¢@), 
and an arbitrary pe 1, (note that I, # @). 
3.2 Lemma. Let (I,),, y: UW 2, 23. Then for every formula @: 
If pEL>, qr(p) In, pel,, anda,, ...,a,, €dom(p), then 


(*) 
WE pfdo,.--. 4,1] iff BE Pl Pao), --., Par— 1). 
Informally, 3.2 says that partial isomorphisms from I, preserve formulas 
of quantifier rank <n. It makes precise the idea discussed in 1.2(e) that 
formulas with quantifiers are preserved under partial isomorphisms provided 
these isomorphisms admit certain extensions. 


PROOF OF 3.2. We show (*) by induction on formulas g. Suppose @ € L°, 
qr(~) Tn,pel,, anda,, ...,a,—; € dom(p). 


(i) For atomic ¢ the result was proved in 1.2(c). 
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(ii) If @ = 4w then 


WE wlag,.-., a,-,] iff not WE WLhdo,---,d,-1] 
iff not 2B = WLp(4o), ..., Par-1)] 
(by induction hypothesis) 


iff BE @Lplao),..., P(4,-1)1- 


(iii) For p = Wo v Wy, the argument is analogous. 
(iv) Suppose m = 4xw. Since ge L5, v, does not occur free in g. Thus 


v, 
Ixy — dv, w~—, and therefore we may assume that x = v,. Because 
x 


qr(@) = qr(dxy) I n, we have qr(v) In — 1. The claim for @ is now 
obtained from the following chain of equivalent statements: 


(a) YW - plag, see a,—]. 

(b) There isa € A such that YE wWfao,..., 4-1, a]. 

(c) There is ac A and qé/,_, such that q>p, a¢dom(q), and WE 
Wlao, »++5 Gyp—45 a}. 

(d) There is ac A and qéI,_, such that q > p, a¢dom(qg), and BE 
WLp(ao), fate p(a,—1); q{a)]. 

(e) There is b€ Band q € /,_, such that q > p, b € rg(q), and 


B fe W[p(ao), 239 pla, - 1), b}. 


(f) There is b€ B such that 8 & W[p(ao), ..., p(a,—1), Db]. 
(g) BE glpao),..., Par-1)]. 


To prove the equivalence of (e) and (f) and of (b) and (c), respectively, one 
uses the back- and forth-properties of the sequence (I), . The equivalence 
of (c) and (d) follows from the induction hypothesis. 


From the foregoing proof we extract another result needed in the next 
chapter. Two structures 2f and 23 are m-isomorphic (written: YW ~,,B) if 
there is a sequence [y,..., | of nonempty sets of partial isomorphisms from 
QI to 23 with the back- and forth-properties, 1.¢., 


forn +1 <m,pél,,, and a€A (resp. D&B), there is qel, 
such that q > pand a € dom(gq) (resp. b € rg(q)). 


In this case we write (L),,,: YW =,, 23. 
Since the proof of 3.2 shows that partial isomorphisms in I, preserve 
formulas of quantifier rank Im, we have 


3.3 Corollary. If 2 =,,23 then Yf and 23 satisfy the same sentences of 
quantifier rank <m. LI 


The following considerations are needed for the converse of 3.1. 
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Given a set ® of S-formulas we write <®>, as in VIII.3, to denote the 
smallest subset of L* which contains ® and is closed under propositional 
connectives (i.e., which contains together with wand y also TWand (w v {)). 

It is easy to verify: 


(1) If every formula in ®, is logically equivalent to a formula in ®;, then 
every formula in ¢®, > is logically equivalent to a formula in (0,). 


By induction on @ one can show: 
(2) If pe Land ar(g) <n + 1 then 
pe (ye L?lar(y) <n} v {Ax we Lilqr() <n}. 
From (1) and (2) we obtain 
3.4 Lemma. Let ne N. Then for ever?;r € N there are, up to logical equiv- 
alence, only finitely many formulas in L> of quantifier rank <n. 
ProoF. By induction on n. 


n =(: Since S was assumed to be finite and relational, there are only 
finitely many atomic formulas in LS. By VIII.3.2 there are, up to logical 
equivalence, only finitely many quantifier-free formulas in L?. 


Induction step: Let r € N. By induction hypothesis there are formulas 


Wo. ---W,—-1 € L® of quantifier rank <n 
and formulas 
Viiuhae png Lo 4 OF quantifier rank <n 


such that every formula in L? (resp. L5, ,) of quantifier rank <n is logically 
equivalent to some w; (resp. x;). We show: 


(*) Every formula in 


®, = {We Lilqr(w) < n} vu {3x We Li lary) < n} 


is logically equivalent to a formula in 


D, = {Wos---5 Wy-1t U (db, Xo, ++ Jv, Xn-1}- 


From this we obtain the claim in the induction step as follows: Every formula 
in LS of quantifier rank <n + 1 is in ¢<®,> (cf. (2)) and is therefore, by (*) 
and (1), logically equivalent to a formula in (0,). But by VIII.3.2 <®,> 
contains only finitely many formulas which are pairwise logically non- 
equivalent. 


Proof of («): By choice of the wy, every yé L? with qr(W) < 1 is logically 
equivalent to some y,. If 3x € L and qr(w) < n, then — Ixw — Jo, 
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The formula yo is in L3,, and has quantifier rank <n, hence is logically 


equivalent to some x;; thus 4x w is equivalent to 3v, y;. = 
We conclude the proof of Fraisse's theorem by showing 


3.5 Lemma. If YW = 2 then UW =, B. 
PRoor. Suppose YJ = 8. For nEN we define I, as follows (cf. (*) in 3.2): 


pel, iff pe Part(Ql, B) and there arere Nand dpy,...,4,-,€ A 
with dom(p) = {a,, ...,a,—-,} such that for all pe LS 
with qr(p) <n 


We gldy,...,4,-1} Mf BE —lplay),..-, p(a,-1)]. 


We show that (,)ney: Ul Zu B. 

(I,),, has the forth-property. For, suppose that pe I,+,,ae¢A and 
dom(p) = {ag,...,4,-1}. By 3.4 we can pick finitely many formulas 
Wore. WE Es of quantifier rank <n such that every formula in L*, , of 
quantifier rank <7 is logically equivalent to some w,;. 

For 0 <i < 5 we let 


=| vi, if WE Wilao,--., 4-14) 
Pj Vis. tf WE WwW ldese xs Gaya]: 


Then YW & dug aA... A @)[ao,.--, @,—1]. Since 


qr(dv,(Po A+++ A Q)) Snt+ 1 
and pe Z,,,,, it follows that 


8 - dv Po BS ae YS ~s)Lp(ao), a) p(a,—1)], 


say 23 Po A... A @,[P(ao), ---, P(a,_;), b]. Then in Wf and 23 the elements 
a,, -+-5@,—5, a and p(do),...,p(a,—;), b respectively, satisfy the same 
formulas among the w; and therefore satisfy the same formulas of quantifier 
rank <n. Thus p=q u {(a, b)) is a partial isomorphism which extends p 
(cf. 1.2(c)), has a in its domain, and lies in I,. 

The back-property is proved analogously. 

Finally, every I, is nonempty: Since % = 23 the same sentences of quan- 
tifier rank <n hold in YI as in 23, and therefore p = @ lies in I,. LJ 


If 2{ and 2 satisfy the same sentences of quantifier rank <m, the last 
argument in the preceding proof shows that p = @ is an element of Jo, 
I,,..., L. Summarizing, we have 


3.6 Lemma. /f 2 and 23 satisfy the same sentences of quantifier rank <m then 
Wx, B. L 


84. Ehrenfeucht Games 


Assertions 3.3 and 3.6 yield 


3.7 Theorem. Let S be finite and relational. For S-structures Y and 23, the 
following are equivalent: 


(a) Ua, B. 
(b) QI and 23 satisfy the same sentences of quantifier rank <™m. L 


3.8 Exercise. Suppose S = {P,,..., P,-,} with unary reiation symbols P;. 
Show that for every S-structure Wf and every m > | there is a structure 23 
such that YW =,, 23 and 2 contains at most m- 2’ elements. (Hint: Consider 
the 2" subsets of A of the form A, ™...A,_-,, where A, = P4 or A; = 
A — P#. Choose 2 to be a structure whose corresponding sets have the same 
number of elements if this number is <m, and otherwise have m elements.) 


3.9 Exercise. Again let S = {P,, ..., P,_,} with unary relation symbols P,, 
let m > 1, and g € L$ bea sentence of quantifier rank <m. Show: 


(a) If @ is satisfiable, then ¢ is already satisfiable over a domain with at most 
m- 2” elements. 


(b) {yw|we LS, w valid} is R-decidable. 


§4. Ehrenfeucht Games 


The algebraic description of elementary equivalence is well-suited for many 
purposes. However, it lacks the intuitive appeal of a game-theoretical 
characterization due to Ehrenfeucht, which we describe in the present 
section. 

Let S be an arbitrary symbol set and let Y and 23 be S-structures. To 
simplify the following formulation we assume A ™ B = (J. The Ehrenfeucht 
game G(Y, 8B) corresponding to Yl and 2 is played by two players, I and II, 
according to the following rules: 


Each play of the game begins with player I choosing a natural number 
r > 1; ris the number of subsequent moves each player has to make in the 
course of the play. These subsequent moves are begun by player I, and both 
players move alternately. Each move consists of choosing an element from 
A u B. If player I chooses an element a; € A in his ith move, then player [I 
must choose b,; € B in his ith move. If player I chooses an element b; € B in 
his ith move, then player II must choose a; € A. After the rth move of player 
II the play is completed. Altogether some number r > 1, elements a,, ..., 
a, € A and b,,...., 0, € B have been chosen. Player II has won the play iff by 
p(a;) = b, for i = 1,..., ra partial isomorphism from YW to 23 is defined. 

We say that player II has a winning strategy in G(QI, 23) and write “IT wins 
G(2, 8)” if it is possible for him to win each play. (We omit an exact defini- 
tion of the notion of winning strategy.) 
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4.1 Lemma. YI = , 2 iff II wins G(Q, B). 


This lemma, together with Fraisse's theorem, yields the desired game- 
theoretical characterization of elementary equivalence: 


4.2 Ehrenfeucht's Theorem. For finite S and arbitrary 2 and 8: 
W= 8 iff IL wins GQ, 23). 
PROOF OF 4.1. Suppose (J, ney: UW =, B. Then Urey: UW =, B also, where 
I,:= {p|there is q € 7,, such that p € q). 


We describe a winning strategy for player I: 

If player I chooses the number r at the beginning of a G(Q, 23)-play, then 
for i = 1,...,r player IT should choose the elements a; (or resp. b;) so that 
by p{a,) = 6; for | <j <i one obtains a partial isomorphism p; in /,_;: 
this is always possible because of the extension properties of partial iso- 
morphisms in (J},),-,y- For i = r it follows that player {1 wins the play. 

Conversely, suppose that player II has a winning strategy in G(Y, 23). 
We define a sequence (J,),-. as follows: 

For née N let 


pel, iff pe Part(QI, 23) and there are jE N and a,,...,a 

such that 

(i) dom(p) = {a,,...,4j}; 

(ii) there isan m > nand a G(YI, 23)-play which II plays 
according to his winning strategy, which player I 
opens by choosing the number m + j, and where in 
the first j moves the elements a,,...,a;¢€A and 
P(a,),.-., p(a;) € B are chosen. 


;E A 


Jj 


From the rules of the game we immediately obtain that (1,),,,  : WU - 23. L 


CHAPTER XII 
Characterizing First-Order Logic 


In this final chapter we present some results, due to Lindstrom [21], which 
we have already mentioned several times. They show that first-order logic 
occupies a unique place among logical systems. Indeed we shall prove: 


(a) There is no logical system with more expressive power than first-order 
logic, for which both the compactness theorem and the Lowenheim- 
Skolem theorem hold (§3). 

(b) There is no logical system with more expressive power than first-order 
logic, for which the Lowenheim-Skolem theorem holds and for which the 
set of valid sentences is enumerable ($4). 


1 Logical Systems 


In the definition of a logical system which follows, we collect several proper- 
ties which are shared by the logics we have considered so far. As we are mainly 
interested in semantic aspects we shall speak of a logical system as soon as 
we have the following: We are given, for every symbol set S, an "abstract" 
set whoseelements play the réle of S-sentences, and in addition, a relationship 
between structures and such sentences which corresponds to the satisfaction 
relation. and determines whether an "abstract" sentence holds in a structure. 


1.1 Definition. A logical system £& consists of a function L and a binary 
relation E ». L associates with every symbol set § a set L(S), the set of’S- 
sentences of &. The following properties are required: 


(a) If So Cc S; then L(So) —_ L(S,). 


(b) If We» ~@ (ie. if Wand gare related under — y), then, for some S, W 
is an S-structure and ge€ L(S). 
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(c) (Isomorphism property) If 2 ¢ g@ and UW = Bthen BE vo. 
(d) (Reduct property) If So < S,, pe L(S,), and WY is an S,-structure then 


We, iff UWPSoee¢”. 


Li,2L£ 5, Li, Lo, and Lg are logical systems. For instance, in the case 
of &, we choose L, to be the function which assigns to a symbol set S the set 
L\(S):=L of first-order S-sentences, and we take - y, to be the usual 
satisfaction relation between structures and first-order sentences. 

If ¥ is a logical system and yeé L(S), let 


Mod$,(¢) := {219 is an S-structure and YW — y op}. 


In case S is clear from the context we just write Mod ,(¢). 

Mod%(@) can be regarded as a mathematically precise counterpart to 
the meaning of «. It suggests the following definition of when a logical system 
¥’ has more expressive power than ¥, namely, if for every 9-sentence @ 
there is an 9'-sentence w with the same meaning. 


1.2 Definition. Let Y and & be logical systems. 


(a) ¥’ is at least as strong as ¥ (written: Y < &’) iff for every S and every 
go € L(S) there is a w € L'(S) such that 


Mod*®(@) = Mod%(). 
(b) and ¥' are equally strong (written: Y ~ #)ifffHY TI #’and #' < #. 


EXAMPLES. #7, I #4; £y I Ly; not fy < 9 (cf. 1X.1.7); Li < 
(cf. IX.2.7); not Yy < #g (proof !) and not Vg < Ly. 


1 


On our abstract level we now formulate some properties of logical 
systems which are known to hold for the systems we have considered so far. 


Boole(#) (* £ is closed under propositional ("Boolean") connectives") if 
(1) and (2) are satisfied. 


(1) Given Sand @gé L(S), there is a y € L(S) such that for every S-structure UW: 
Weyyx iff notWeye. 
(2) Given Sand o, ¥ € L(S), there is a y € L(S)such that for every S-structure 
QT: 


If Boole(&) holds then let — mand (@ v y)stand for formulas x in the sense 
of (1) and (2) above. (g A $), (g > $), ... are used analogously. 


Rel(£) (’ ¥ permits relativization", cf. VIII.2): 


For S, ge L(S), and unary U there is a YE L(S u {U}) such 
that 


(UW U4)e,w iff (Ul"Hsv¢e 
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for all S-structures QU and all S-closed subsets CI‘ of A. ((U*]" 
is the substructure of 21 with domain CI’.) 


If Rel(#) holds let g” be a formula & with the above property. 
Elim(#) (* # allows elimination of function symbols and constants"): 


If S is a symbol set and S' is chosen as in VIII.1, then for any 
@ € L(S) there is a w € L(S‘) such that for all S-structures QI: 


Weve iff WE ow. 


(For the definition of Ql” cf. also VIII.1). If Elam(#) then we write @” for a 
formula yw with the above property. 


1.3 Definition. A logical system & is said to be regular if it satisfies the 
properties Boole(#), Rel(#), and Elim(#). 


All logical systems which we have hitherto considered are regular. In the 
case of &, we verified Elim(#,) and Rel(#,) in VHI.1 and VIII.2. The 
arguments given there can also be applied without difficulty for the other 
logical systems. 

We tacitly adopt some semantic notions whose definition can be extended 
from £&, to the general case in a straightforward manner. For example, 
~ € L(S) is said to be satisfiable if Mod’.(g) # @, and valid if Mod4,(q) is 
the class of all S-structures. If ® < L(S) then DE y@ means that every 
model of ® (in the sense of — ,) is a model of gw. Note that these definitions 
refer to a fixed symbol set S. However, using the reduct property 1.1(d) one 
can argue as in EI.5.3 to show that they do not depend on S. In the sequel 
similar applications of 1.1(d) will be made without explicit mention. 

We introduce the following abbreviations: 


LéSko(#) ("The Lowenheim-Skolem theorem holds for #”): 
Every satisfiable sentence of & has an at most countable 
model. 


Comp(£#) ("The compactness theorem holds for #”): 
If P is a set of sentences of ¥ such that every finite subset of ® is 
satisfiable, then ® is satisfiable. 


In this terminology the result of Lindstrom mentioned in (a) reads as follows: 


Let # be a regular logical system such that #, < ¥Y, 
L6Sko(#), and Comp(#). Then ¥ ~ ¥). 


Before embarking on the proof (in §3) we derive in the next section some 
properties of logical systems for which the compactness theorem holds. 


1.4 Exercise. Let & be given by 


(i) L(S):={@|@ is an L§-sentence of the form 3X9...4X,—,W, where 
does not contain a second-order quantifier}. 
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(ii) For g € L(S) and any S-structure YW, 


Weve iff Ceo g. 
Show: 


(a) # is a logical system. 

(b) L6Sko(#), Comp(#), Rel{(#), and Elim(#). 
(c) Not Boole(#). 

(d) ¥, < ¥ but not ¥ < ¥,. 


§2. Compact Regular Logical Systems 


In this section ¥ will always be a regular logical system such thut #; < /%. 
For a first-order S-sentence q, let y* be a sentence in L(S) which has the 
same models as @, L.¢., 


Mod‘,,(~) = Mod3(¢*). 


For a set ® of first-order S-sentences define ®* := {w*|g € PB}. 

If ¥ is compact, i.e., Comp(#) holds, then also the compactness theorem 
for the consequence relation holds, as can be shown similarly to the first- 
order case: 


2.1 Lemma. Suppose Comp(#), and let ® u {g} < L(S) and ® — yg @. Then 
there is a finite subset Dy of ® such thut OE y «@. 


Proor. Choose @ by Boole(#). Then ® VU {7g} is not satisfiable. By 
Comp() there is a finite subset Py of so that ®, u —ep} imnot satisfiable, 
Le., D, Fv @. ] 


As a further property of compact logics, we show that the meaning of an 
L(S)-sentence only depends on finitely many symbols from S: 


2.2 Lemma. Suppose Comp(#) and yw € L(S). Then there is a finite subset 
So of S such that for all S-structures Qf and B, 


PROOF. We restrict ourselves to the case where S is relational (the case we 
shall subsequently need). There is no difficulty in extending the proof to 
arbitrary symbol sets. 

Choose new unary symbols U, V, and f. Let ® consist of the following 
first-order S u {U, Vf }-sentences, which say that f is an isomorphism 
between the substructure induced on U and the substructure induced on V: 


dxXxU x, ax VX, 
Vx(Ux > Vx), Vy¥Vy > Ax(Ux a fx = y)), 
Vx VWyh(Ux aA Uy a fx = fy) > x =)), 
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and for every Re S, R n-ary, 
Vi gice Vid CONG Avene A UR og) ORN Ges Kay SR Xo ce f Mpeg d)- 
Then, firstly, 
(1) OEY Woy”, 
In fact, if 2f is an S-structure and (WU, U4, V4,f 4) & ¢ ®*, ie., (MU, U4, V4F%) 


k= @, then U4 and V% are nonempty and f“ [ U4 is an isomorphism from 
[U*}" to [V4]™. By the isomorphism property (cf. 1.1(c)) we have 

[U*Peew if (VAP ee, 
that is, by Rel(#), 

QWU)e ow if WV) ey" 

Using the reduct property and Boole(&) we obtain 

(QI, Us, V4, f “) ae a= yr. 
Thus (1) is proved. By Comp() there is a finite subset Dy of D such that 
(2) Dey oy’. 
Since ®, consists of first-order sentences, we may choose a finite subset So 
of S such that Bp consists of So-sentences. We show that So has the desired 
properties. Suppose YW and 8 are S-structures and n: YW [So = BP So, 
where we assume A n B = @. (Otherwise we can take an isomorphic copy 


of 8 and use the isomorphism property.) We define over C:=A u B an 
Su {U, V, f )-structure (€, US VS, f ©) as follows (note that S is relational): 


R°:=R' uR® forReS, 
US := A, 
Vo =B, 
f© such that f° [US =n. 
ie (€, US, VS, £9 is a model of Do, ie. (C, US, VS, ff) & gv BF. Hence by 
(2), 
7 (CULV SIR eyo", 
and therefore, using [US]" = % and [V°]* = &, 
Wev,w if BE,YW. L] 


Two S-structures 2 and 8 are said to be 2-equivalent (written: YU = , 8), 
if for all w € L(S), 
Weyw iff BEyYwW. 


For WY = y, 8B we continue to write simply YW = B. Clearly, if Y ~ Y, then 
Q{ = B implies YW = ,%B. We show that the converse holds for compact O- 
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2.3 Lemma. Assume Comp(-%’) und suppose that U = B implies N=yB 
for arbitrary U, 23. Then 2 ~ Y,. 


PRoor. Since #; < &, given S and we L(S), we must show that there is a 
first-order S-sentence ~ such that 


Mod »,(~) = Mod (W) 
Suppose yw is satisfiable. (Otherwise we let g :=Vx 1x = x.) First, we claim 


For every 2 € Mod,($) there is @y € L3 such that 


We oy and gery. 
To show this we let 2f€ Mod,($). Then for the first-order theory Th(2I) 
of QT, 
Th(Q)*  ¢ w. 


For if 8 & vy Th(QQD*, 1e., B & Th, then 8 = Y by hypothesis; therefore 
8B =, and hence BE yw. 

Since Comp(#), there are r and @o,...,g,¢€Th(Ql) such that 
{pg,..., QFE ¢ w We set dy:= Goa... AQ, Then gyé THAD, ie., 
YE my, and py Ey W. Thus (1) is proved. 

From (1) we immediately obtain 


(2) Mody(y)= |) Mod,(ei). 


Me Mod ¢ (W) 
Now we show that there are W%,,..., 21,, € Mod,($) such that 
(3) Mod Aw) = Mod ,(g%,) u... u Mod,(o%,). 
Otherwise, for arbitrary W),..., WI, from Mod,($) we would have 
Mod AW) 2 Mod ¢%,) U --- U Mod (93,,), 


and thus every finite subset of {$} u {7@%|2l © Mod,($)} would be 
satisfiable. By Comp(#’) the whole set would be satisfiable, in contradiction 
to (2). 

Writing Mod,$) as in (3) we obtain 


Mod WW) = Mod y(gq,) U «+» U Mod v,(@u,,) 
= Mody,(@u, V +++ V Gu,,,): 
Hence for @ := @y, V --. V Py, we have Mod 4,(~) = Mod,($). L 


§3. Lindstrom's First Theorem 


We now have all tools available to obtain the following characterization of 
first-crder logic. 
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3.1 Lindstrom's First Theorem. Let ¥ be a regular logical system such that 
L, < ¥. Then 


L6Sko(#) and Comp(#) imply ¥ ~ ¥,. 


PRooF. Given & satisfying the conditions of the theorem we must show 
that ¥ < I By 2.3 it is sufficient to prove for all S: 


(+) For all S-structures YW and 23, if YW = 23 then W =z 23. 


We can restrict ourselves here to relational symbol sets. Assuming that (+) 
has been established for relational symbol sets, we can then give the following 
argument for arbitrary S: 

Suppose 2 = 23. Considering S’, 2", and B" we obtain, first of all, UW" = B" 
(cf. VII.1.7). Then for the relational symbol set S’, (+) yields YW" =, 8B". 
Finally, using Elim(&), we have for arbitrary yw € L(S): 


We,yw iff We, 
iff BE SY (since UW =. 8B") 


and thus YW =, 23. 
For relational § we now prove (+). Assume, for contradiction, that we 
have S-structures Yf and 8, and yw € L(S) such that 


(1) YW = 8, WEywyW, BE, TW. 


Corresponding to yw we choose a finite subset S, of S as in lemma 2.2, so that 
the meaning of y% only depends on the symbols in Sp. 

Since YW [Sp = BPS, holds, YW [So and B [So are finitely isomorphic 
by Fraissé’s theorem, and hence for suitable (1,),,,. we have 


(2) Uinen: UP So = BPSo, We > W, BE, 1. 


The central idea of the proof is to apply Comp(#) and L6Sko(.) to obtain 
structures 20’ and 8’ which are at most countable and for which 


(3) Whs,=,B lS. Wey, and BE, TY. 


Once we have (3) we get the desired contradiction: The at most countable, 
partially isomorphic structures YI’ }S, and 23' [Sg are isomorphic (cf. 
XI.1.5(d)). Hence by the choice of Sg 


Weewy iff BeEyyY, 


and this contradicts W' = » Ww and 23 = » Ty of (3). 

In order to proceed from (2) to (3) we give a suitable "description" of (2) 
in ¥&. We may assume that A n B = & for YW, B in (2) (otherwise take an 
isomorphic copy of 23). Let the symbol set $* be formed from S by adding 
the following new symbols: a unary function symbol f and relation symbols 
P,U,V (unary), <, I (binary), and G (ternary). We define an S+-structure € 
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in which (2) can be suitably described. In particular © includes YI and $3 and 
also contains the partial isomorphisms of the I, in (2): Set 


(a) C=AuBu Nat eegty: 
(b) US = A and [UT]® = W: 
(c) V° = Band [V°]®5 = $3; 


((b) and (c) are possible since A n B = @ and since S is relational.) 


(d) <° the natural ordering relation on N and {© { N the predecessor func- 
tion on N, ie., f“(n + 1) = n and,f©(0) = 0; 

(ce) pe = nen Le 

(f) [‘np iffne N and pel,; 

(g) G°pab iff P“p, a € dom(p) and p(a) = b. 


This is illustrated in Fig. 12.1 


Figure 12.1 


© is then a model of the conjunction y of the following finite set of sentences 
of L(S*). (Since Y,; < ¥ we use first-order sentences as an intuitive notation 
for the corresponding sentences of #.) 


(1) Vp(Pp > Vx Vi(Gpxy — (Ux a Vy))). 
(ii) Vp(Pp > Vx Vx' Vy Vy'(Gpxy a Gpx'y') > (x = x' oy = y’))). 
(iii) For every R € So, R n-ary: 
Vp(Pp > VXq ++. VXn—1 Vo +--+ V¥n-1(GPXoVo A «+. A GPXn—1Vn-1) 
(Rx, or Xn-1 = Ryo SEG Vn ~1))). 
(In effect (i), (41), and (111) say that for a fixed p € P, Gp. . describes the graph 


of a partial isomorphism of the Sp-substructure induced on U to the So- 
substructure induced on V.) 
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(iv) The axioms of ®,,,4 (for partially defined orderings) and 


Vx(dy p<xno(fx <x a Td2fx<zaz<-x))) 


(1.e., (field <, <) is an ordering with predecessor function f"). 
(v) Vx(4y(x < y v y < x) — Sp(Pp a Ixp)) (ie., if x is in the field of < 
then I, = {p|Pp a Ixp} is not empty). 
(vi) Vx Vp Vu((fx <x a Ixp A Un) 
— dg dvoUfxq a Gquv A Vx’ Vy(Gpx'y’ + G@x'y’))) 
(the forth-property). 
(vii) An analogous sentence for the back-property. 
(viii) Ix Ux a ap Vy a WY a (a)" (note that US = A, VS = B, Wee y, 
Bir» Ty). 


We choose a new constant c and write f°c,f ‘4c, f7c,... for the terms c,fc, 
Hfc,.... Then we let 


Y = {x} u {...ffc < fe <c), 


1.e., 
Saye es fee Ny. 


Every finite subset of ‘Y has a model, namely €’ = (€, c“), where c“' is a 
sufficiently large natural number. By Comp(#) there is a model of ‘P, say 
(D, c”). 

Dcontains an infinite descendingchain, namely...(f*c)? <? (fc)? <?c”. 
For what follows we need a countable model with this property. LOSko(#) 
does not directly help us here because it only applies to single sentences, 
whereas ‘Y is an infinite set of sentences. We circumvent this difficulty 
using a new unary relation symbol Q: Let 3 be the L(S* u {c, Q})-sentence 


3 = Qc a Vx(Qx > (fx < x a Qfx)) 


("Qcontains c and every element of Q possesses an immediate <-predecessor 
which also belongs to Q"; thus Q is a subset of the field of <). 
If QP = {(f"c)?|n © N} then 


CD; on Q”) F vx A 3, 


Le., % A 3 is satisfiable. Therefore, by LOSko(#), there is an at most 
countable model (€, c£,Q") of y a 3. Since (viii) holds in ©, U* ¥# @, 
V® # @,and because is relational, U and V* are domains of substructures. 
We set 


») @ _ EUa Te. 3B’ a yey 


and show that the at most countable structures QI’ and 8’ satisfy the condi- 
tions in (3). By (viii), E& gw" and E€ = ¥(4y)", and from this we obtain 


(4) Wey, BE, ay. 


202 XII. Characterizing First-Order Logic 


From (i), (ii), and (iii) we know that to every p € P* there corresponds a 
partial isomorphism from Yl’ f Sy to B’ | S,, which we also denote by p. 

Since (, c®, 0") & 9, for every n € N the element e, :=(f"c)* belongs to 
Q* and the e, form a descending chain ...e, <"e, <*e, <* ep. Let 


I :-={p|there is an n such that I*e, p}. 


Using (v) we see that J] # @, and using (vi) and (vii), that 7 has the back- 
and forth-property. For example, to verify the forth-property one can reason 
as follows: if pé J, say I*e,p, and ae A’ = U*, then by (vi), there is q such 
that I’e,+ ,q (thus g €I),q > p,and a €dom(q). To summarize, we have 


(5) 1: WPS. =, Bi P So. 
(4) and (5) yield (3) and thus 3.1 is proved. a 


By closer inspection of the above proof we can obtain the following lemma, 
which we shall need in the next section. 


32 Lemma. Let ¥ he a regular logical system such that ¥,< ¥ and 
LOSko(¥). Assume that for some finite set S of relation symbols there is 
I) € L(S) such that for every m € N there are S-structures M,, and B,, with 


(*) WU, =m Bm, Ws, E ew and Bn E,W. 


Then in £&, the class of finite orderings can be described in the following sense: 
There is a jinite S, containing symbols <,c and a sentence x; € L(S,) such 
that (a) and (b) hold: 


(a) If We yy, then (A, <“) is a partially dejined ordering and c4 is an 
element of the field with only finitely many <4-predecessors. 

(b) For every méN there is a model Y of 7, in which cA has at least m <*-pre- 
decessors. 


PRooF. Let S, I) be given as above. Choose x and c as in the preceding proof 
(taking now Sy = S) and set 7; = y A” c is in the field of <”. 

We first prove (b). For a given m, let Y,, and %,, be as in (*) and let 
(1,),,,. 1, = B,,. Define © as in the proof of 3.1 except for the following 
obvious modifications: 


(i) <© is the natural ordering on {0,...,m); 
(ii) aia —- Pee I; 


For c©:=™m the structure (6,c°) is a model of y,, and c© has m <‘- 
predecessors. 


PROOF OF (a): Suppose there is a model (D,c”) of y, in which c? has 
infinitely many <?-predecessors. From (D,c”) we can proceed as in the 
proof of 3.1 to obtain isomorphic structures QU’ and B’ such that QU’ — I) and 
8’ — —1/), a contradiction. LO 
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Lindstrom's theorem 3.1 characterizes first-order logic in the following 
sense: Among the regular logical systems there is none of greater expressive 
power which still satisfies the compactness theorem and the Léwenheim-— 
Skolem theorem. 

If one considers the defining properties of regular logical systems &, the 
properties Rel(#) and Elim(#)do not seem as fundamental as the others. An 
analysis of the proof of 3.1 shows that both these properties were used to speak 
about two structures Yf and 8 by placing them together in the structure €. 
There are alternative properties that can be used for the same purpose (cf. [2 ]). 
But if there is no substitute at all for Rel(#) and Elim(™), then there are 
counterexamples to 3.1. 


§4. Lindstrom's Second Theorem 


In our considerations of logical systems we now pay special attention to 
syntactic aspects. In this connection we recall the following properties of 
first-order logic: For a decidable symbol set S 


the S-sentences are concrete finite symbol strings and the set of S-sentences 
is decidable; 


Operations such as negation, relativization, and the elimination of 
function symbols can be carried out effectively; 


there exists an adequate proof calculus, and therefore the set of valid 
S-sentences is enumerable. 


We shall consider these aspects for logical systems in general, thereby 
arriving at the concept of an effective logical system. Within this framework 
we can then formulate and prove the result of Lindstrom mentioned in the 
introduction to this chapter under (b). 

When speaking of a decidable set we understand it to be a set of words 
over a suitable alphabet which is R-decidable in the sense of X.2.5. 


4.1 Definition. Let & be a logical system. # is called an effective logical 
system if for every decidable symbol set S the set L(S) is decidable, and for 
every yo € L(S) there is a finite subset Sg of S such that @ € L(So). 


4.2 Definition. Let Y and &’ be effective logical systems. 


(a) Y Ser ¥’ iff for every decidable S there is a computable function : 
which associates with every @ € L(S) a sentence g* € L’(S) such that 
Mod(~) = Mod‘,(@*). 

(b) Y ~ ep’ iff 9 Depp 4H’ and 2" Ter &). 


Y,, Li, Ly, and Lg are effectivelogical systems, but #,,,,, is not. We have, 
for instance, 4, <.f Yas Ln Sen Lu- 
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4.3 Definition. Let & be a logical system. & is said to be effectively regular 
if Y is effective and if the following effective analogues of Boole(#), Rel(#), 
and Elim(#) hold. 

For every decidable symbol set S: 


(i) There is a computable function which assigns to every ge L(S) a 
formula “@, and, in addition, a computable function which assigns to 
any g and w € L(S) a formula (gy v $). (Here “4, for instance, denotes 
an L(S)-sentence w such that WE » wiffnot WE , w (cf. the formulation 
of Boole(.#) in 81). 

(ii) For every unary U, there isa computable function which associates with 
every y € L(S) a formula @”. 

(ii) There is a computable function which associates with every @ € L(S) a 
formula g’ € L(S’). 


SL), LY, £g are effectively regular logical systems. 
Let & be an effective logical system. We say that & is enumerable for 
validity if for every decidable S, the set 


VELS) Fe g} 


is enumerable. 

Clearly, if & has an adequate proof calculus then & is enumerable for 
validity. (Examples are #, and #v.) 

Lindstrom's second theorem tells us that no proper extension ¥ of Y, 
with LOSko(#) can have an adequate proof calculus. 


4.4 Lindstrom's Second Theorem. Let & he an effectively regular logical 
system such that #y Sen &. If’ LOSko(#) and if & is enumerable for validity 
then # ~ or L. 


Proor. Let ¥ satisfy the hypotheses of the theorem. We prove that # < orp, Ly 
in two steps. 
First we show 


(+) For every decidable S and for every yw € L(S) there is a first- 
order S-sentence ~ which has the same models as §. 


Then we shall prove that the transition from yw to @ can be carried out 
effectively: Given a decidable S, we shall set up a procedure which yields for 
every yw € L(S)a first-order S-sentence with the same models. 

Since ¥ is an effective logical system. we only need to give a proof of (+) 
for finite decidable S (cf. 4.1). We leave it to the reader to show, using Elim(#), 
that we can furthermore assume S to be relational. 


Thus we assume that S is decidable, finite and relational. As a first step 
we prove 
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4.5 Lemma. If for some w€ L(S) there is no jirst-order S-sentence with the 
same models as W, then for every mé N there exist S-structures Y,, and B,, 
such that 


(*) Wn Zi Om UW, E vW, Bn, Ey Ty. 


PRoor. yw is satisfiable, otherwise Mod,($) = Mody,(sv9 1» = vo). We 


proceed indirectly, assuming that for a suitable m € N and for all S-structures 
Qt and 23: 


(1) If 2 =, B thenQWe,wifBes,w). 


Let @o,.--, @, be, up to logical equivalence, the first-order S-sentences of 
quantifier rank <m (cf. XI.3.4). We have (cf. X1.3.7): 


(2) Uz, Biff fori=O0,...,k, QE o; iff BE o)). 


For an S-structure % let @ be the conjunction of the formulas in 
{9,;{0 <i < k, Wt ¢,}. Then, by (2), for arbitrary 8, 


(3) Uz, Biff BE Pa. 

Let @ be the disjunction of the (finitely many!) @y for which WE » y, Le., 
(4) o = \/{oy| WM S-structure, We y p}. 

We show: 


(5) Mod Cy) = Mod y,(9), 


and thus obtain a contradiction to our assumptions. Suppose first that B is 
a model of ¥. Then @y is a member of the disjunction in (4), and since B F oy, 
we have 8 t= @. Conversely, if 8 — g, there is an 2 such that WE » w and 
BE Py (cf. (4)). Then by (3), W =, 23, and finally by (1), BE wy. = 


To continue with the proof of (+), we assume that there is a sentence w 
in L(S) for which there is no first-order S-sentence with the same models, and 
aim for a contradiction. By the result 4.5 just proved there are, for every 
meéeN, S-structures Q,, and %,, such that YW, =,,8,,. Wnty w, and 
$B, ¢ Ww. Since the assumptions in 3.2 are fulfilled, there is, for some finite 
S,, a sentence x, € L(S,) which describes the finite orderings (as explained 
in 3.2). 

Weextend S$, by adding a new unary relation symbol W and consider the 
L(S, u {W})-sentence 


G:=y, a Ix Wx a Vx(Wx > x < ©) 
(note that #, < &). By the properties of y, (cf. 3.2) we have: 


(a) If W is an S, U {W}-structure such that We ,9 then W’% is finite and 
not empty. 

(b) For every m > 1 there isa model Y of 3 such that W* contains exactly m 
elements. 
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Thus as Y ranges over the models of 9, w4 ranges over the finite sets 
(isomorphism property!). We shall now see that we can use (a) and (b) 
together with Trahtenbrot's theorem to conclude that ¥ is not enumerable 
for validity, in contradiction to our assumptions. We argue as in the proof 
of the incompleteness of second-order logic (cf. X.5.5). 

By Trahtenbrot's theorem there is a decidable symbol set S, such that the 
set of fin-valid first-order S,-sentences is not enumerable. We may assume 
that S, is relational and disjoint from S, u {W). 

Let * bea computable function associating with every first-order S3- 
sentence @ a sentence gy* € L(S,) which has the same models. Then for 
o € L2? we have 


k 
(c) ¢ is fin-valid iff = ~9>(@ )” 


To prove this, we assume first that ¢ is fin-valid. If PI isan (S, u {W) uz S;3)- 
structure such that PI © ,.9, then W “ is finite by (a), and thus [Ww yets2 E © 
But then [W4]™'*? — , »*, and hence Yt = »(p*)”. The converse is obtained 
similarly by applying (b). 

Equivalence (:) enables us to obtain from an enumeration procedure 8 
for the set of valid L(S, u {W)u S;)-sentences an enumeration procedure 
& for the fin-valid first-order S,-sentences, thus yielding a contradiction to 
Trahtenbrot's theorem. & proceeds as follows for n = 1, 2, 3,...: the first 
(lexicographically) # sentences @o,..., Y,—, from L,(S,) are generated, and 
the L(S, u {W} u S,)-sentences 9 + (p%)",...,9 + (@*_,)" are formed. 
(Note that the map * is computable and that the operations of relativization 
and implication are effective.) Then, using $8, one generates the first n valid 
L(S, u {W}u S,)-sentences, listing those g; for which ¢ > (@*)” occurs. 
This finishes the proof of (+). 

Now, given a decidable S, we describe an effective procedure which 
associates with every sentence yw e€ L(S)a first-order sentence @ with the same 
models. Let $8 be an enumeration procedure for the set of valid L(S)-sentences, 
and “a computable function which assigns to every first-order S-sentence 
y an L{S)-sentence 7* with the same models. 

Given yw € L(S), proceed as follows: For n = 1, 2,3,... use 8 to generate 


the first n valid sentences Wo,..-.,W,—, from L(S); then generate the first 
(lexicographically) n sentences @o,..., @,—, from L,(S), and finally, form 
the L(S)-sentences yw @%,...5% — @x_,. Check whether there are i, j such 


that w; = > @. By (+) this must eventually happen. Then let @; be the 
~ associated with w. 


Lindstrom's results initiated a series of investigations of properties of 
logical systems and relations between them, in a general setting (cf. [2]). In 
this way it is possible to bring important aspects of such properties into better 
perspective, thus gaining new insights into concrete logical systems and even 
into first-order logic. We illustrate this briefly, taking the compactness 
theorem as an example. 
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An ordering (A, <*) which contains no infinite descending chain 
...<4a, <4a, <4a, is said to be a well-ordering. All finite orderings are 
well-orderings, as are (N, <") and the ordering which results when (N, <‘Y) 
is extended by adding an isomorphic copy. On the other hand, (Z, <*) and 
(Q, <*) are not well-orderings. 

Let ¥ be a regular logical system such that 4, < #. A well-ordering 
(A, <“) is said to be accessible in ¥ (or 9-accessible) if there is an S with 
< €S and an L{S)-sentence yw such that 


(a) in every model 2B of w, (field <®, <®) is a well-ordering; 
(b) there is a model 8 of y such that (A, <*) < (field <*, <?). 


Since #,; < ¥, all finite well-orderings are 9-accessible. If # is compact 
then no infinite well-ordering is 9-accessible. For if a sentence yw has an 
infinite model A, where (field <4, <*) is a well-ordering, then one can show 
by a method similar to that used in exercise VI.4.11 that yw has a model 23 
in which (field <*, <*) has an infinite descending chain. 

If one assumes L6Sko(#) and strengthens the regularity conditions 
slightly, the following two statements are, in fact, equivalent: 


(i) not Comp(.#) 
(ii) (N, <™) is 9-accessible. 


These considerations motivate us to look beyond the simple dichotomy 
“Comp(#) — not Comp(#)”, and to make finer distinctions: the more 
2'-accessible well-orderings there are, the more the compactness theorem is 
violated. As a measure for the violation one can take the smallest well- 
ordering which is not 2-accessible, the so-called well-ordering number of 
£&.The study of well-ordering numbers has led to a series of fruitful investiga- 
tions (cf. [2]). In particular it has turned out that for certain logical systems 
one can use arguments involving the well-ordering number to compensate 
for the absence of the compactness property. 
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